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SCORE Search Results Details for Application 
10627604 and Search Result us-10-627-604- 
2,rup. 



Score Home Retrieve Application SCORE System SCORE Comments / 

Page List Overview FAQ Suggestions 

This page gives you Search Results detail for the Application 10627604 and Search Result us-10- 

627-604-2.rup. 

start 

Go Back to previous page 

GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title : 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



August 12, 2006, 06:52:40 ; Search time 300 Seconds 

(without alignments) 
635.177 Million cell updates/sec 

US-10-627-604-2 
1115 

1 MLP PAIH FYLLPLAC ILMKS PVQHHRERKRASKSSKHSMS 2 06 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 

2849598 seqs, 925015592 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



2849598 



Database : 



UniProt_7. 2: * 
1 : uniprot_sprot : * 
2: uniprot_trembl : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 



1 1115 

2 1107 

3 1067 

4 1062 



100 .0 
99 .3 
95 .7 
95 .2 



206 
206 
206 
206 



SO SD INHUMAN 
SOSD1_PONPY 
S0SD1_M0USE 
SOSD1 RAT 



Q6x4u4 homo sapien 
Q5r5d2 pongo pygma 
Q9cqn4 mus musculu 
Q642g2 rattus norv 
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5 


96 9 


86 


. 9 


206 


1 


S0SD1_CHICK 


Q6vya3 


gallus gall 


6 


817.5 


73 


. 3 


213 


2 


Q6VYA2_XENLA 


Q6vya2 


xenopus lae 


7 


811.5 


72 


. 8 


213 


2 


Q6 88C5_XENTR 


Q68 8c5 


xenopus tro 


8 


796.5 


71 


. 4 


213 


2 


Q6X4U2_XENLA 


Q6x4u2 


xenopus lae 


9 


74 1 . 5 


66 


. 5 


187 


2 


Q 6 6 KQ 4 _XE NL A 


Q66kq4 


xenopus lae 


1 0 


64 5 


57 


. 8 


20 7 


2 


Q5 6 8H 6_BRARE 


Q56 8h6 


br achy dan io 


11 


615.5 


55 


. 2 


153 


2 


Q 6 VYA 1_XE NL A 


Q6vyal 


xenopus lae 


12 


372 . 5 


33 


. 4 


179 


2 


Q4RYK3_TETNG 


Q4ryk3 


tetraodon n 


13 


349.5 


31 


. 3 


213 


1 


SOST_HUMAN 


Q9bqb4 


homo sapien 


14 


349.5 


31 


. 3 


213 


2 


Q4 95N9_HUMAN 


Q495n9 


homo sapien 


1 5 


34 8 


31 


. 2 


213 


1 


SOST_CERAE 


Q9bg7 8 


cercopithec 


16 


34 0 


30 


. 5 


213 


1 


S0ST_RAT 


Q99p6 7 


rattus norv 


17 


33 3 


29 


. 9 


176 


1 


S0ST_B0VIN 


Q9bg7 9 


bos taurus 


18 


33 1 


29 


. 7 


211 


1 


S0ST_M0USE 


Q99p6 8 


mus musculu 


1 9 


115.5 


10 


. 4 


361 


2 


Q4 RJ6 9_TETNG 


Q4r j6 9 


tetraodon n 


2 0 


114 


10 


. 2 


400 


2 


Q6 PFL 2_BRARE 


Q6pf 12 


brachydanio 


2 1 


113 


10 


. 1 


n o c 
385 


2 


Q6 IR7 9_XENLA 


Q6ir7 9 


xenopus lae 


22 


111 


10 


. 0 


36 9 


2 


Q6 NwA 0_BRARE 


Q6nwa0 


brachydanio 


23 


106.5 


9 


. 6 


1034 


2 


03 588 8_RAT 


035 88 8 


rattus norv 


24 


101 


9 


. 1 


461 


2 


Q6 5 8Q INHUMAN 


Q658ql 


homo sapien 


25 


10 1 


9 


. 1 


1233 


1 


MUC5A_HUMAN 


P98088 


homo sapien 


26 


100 


9 


. 0 


375 


2 


Q9 8TX5_XENLA 


Q98tx5 


xenopus lae 


27 


100 


9 


. 0 


574 


2 


Q8N4M9_HUMAN 


Q8n4m9 


homo sapien 


28 


98 . 5 


8 


. 8 


375 


2 


Q5BL7 4_XENTR 


Q5bl74 


xenopus tro 


29 


97 


8 


. 7 


379 


1 


CYR61_RAT 


Q9es72 


rattus norv 


3 0 


9 7 


8 


. 7 


379 


2 


Q6 6HT5__RAT 


Q66ht5 


rattus norv 


3 1 


97 


8 


. 7 


379 


2 


Q9WTM9_RAT 


Q9wtm9 


rattus norv 


32 


95 . 5 


8 


. 6 


375 


1 


CEF10_CHICK 


P1933 6 


gallus gall 


3 3 


95 


8 


. 5 


379 


1 


CYR61_M0USE 


P18406 


mus musculu 


3 4 


95 


8 


. 5 


379 


2 


Q3TX2 1_M0USE 


Q3tx2 1 


m osteoclas 


J b 


94.5 


8 


. 5 


26 2 


2 


Q7 6C2 7_PAROL 


Q76C2 7 


paralicht hy 


3 6 


93 . 5 


8 


. 4 


988 


2 


097867_PIG 


097867 


sus scrofa 


3 7 


92 . 5 


8 


. 3 


35 9 


2 


Q4 SUZ 2_TETNG 


Q4suz2 


tetraodon n 


3 8 


91.5 


8 


. 2 


1410 


2 


Q2 020 4_CAEEL 


Q20204 


caenorhabdi 


3 9 


91 


8 


. 2 


381 


2 


Q5 3 FA4__HUMAN 


Q53 fa4 


homo sapien 


4 U 


9 l 


o 
o 


. Z 


A O O 

490 




Qb 109 Z_Ah JM 1 K 


ybi 09z 


xenopus tro 


41 


90.5 


8 


.1 


299 


2 


Q7YRR8_PIG 


Q7yrr8 


sus scrofa 


42 


90 


8 


.1 


381 


1 


CYR6 INHUMAN 


000622 


homo sapien 


43 


90 


8 


.1 


381 


2 


Q6FI18_HUMAN 


Q6f il8 


homo sapien 


44 


89 


8 


.0 


984 


2 


Q6 7VW6_ORYSA 


Q67vw6 


oryza sativ 


45 


88.5 


7 


.9 


442 


2 


Q8CCV4_MOUSE 


Q8ccv4 


mus musculu 



ALIGNMENTS 



RESULT 1 
S0SD1_HUMAN 

ID S0SD1_HUMAN STANDARD; PRT ; 206 AA . 

AC Q6X4U4; Q96HJ7; Q9Y3U3 ; 

DT 29-MAR-2005, integrated into Uni Prot KB /Swiss- Pro t . 

DT 29-MAR-2005, sequence version 2. 

DT 07-MAR-2006, entry version 15. 

DE Sclerostin domain- containing protein 1 precursor (Ectodermal BMP 

DE inhibitor) (Ectodin) (Uterine sensi tization-associated gene 1 protein) 

DE (USAG-1) . 

GN Name=SOSTDCl ; Synonyms =USAG1 ; ORFNames=CDA019 ; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa ; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; Hominidae; 

OC Homo . 
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OX NCBI_TaxID=9 606; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [MRNA] . 

RX MEDLINE=22984999; PubMed=146 23 234 ; DOI=10 . 1016/j . ydbio. 2003 . 08 . Oil; 

RA Laurikkala J., Kassai Y. , Pakkasjaervi L. , Thesleff I., Itoh N. ; 

RT "Identification of a secreted BMP antagonist, ectodin, integrating 

RT BMP, FGF, and SHH signals from the tooth enamel knot."; 

RL Dev. Biol. 264:91-105(2003). 

RN [2] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] . 

RC TISSUE =Pheochromocytoma; 

RA Liu F., Xu X.R., Qian B.Z., Xiao H. , Chen Z., Han Z.; 

RT "A novel gene expressed in human pheochromocytoma . " ; 

RL Submitted (MAR-2001) to the EMBL/GenBank/DDB J databases. 

RN [3] 

RP NUCLEOTIDE SEQUENCE [MRNA] . 

RA O' Shaughnessy R.F.L., Yeo W. , Gautier J., Jahoda C.A.B., 

RA Christiano A.M.; 

RT "A novel secreted WNT agonist is a requirement for epithelial - 

RT mesenchymal interactions."; 

RL Submitted (MAR-2003) to the EMBL/GenBank/DDB J databases. 

RN [4] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA] . 

RX MEDLINE=22737999; PubMed=l 2853 948 ; DOI=10 . 1038/nature01782 ; 

RA Hillier L.W., Fulton R.S., Fulton L.A., Graves T.A. , Pepin K.H., 

RA Wagner -McPherson C, Layman D. , Maas J., Jaeger S., Walker R. , 

RA Wylie K. , Sekhon M . , Becker M.C., O'Laughlin M.D., Schaller M.E., 

RA Fewell G.A., Delehaunty K.D., Miner T.L., Nash W.E., Cordes M., Du H., 

RA Sun H. , Edwards J., Bradshaw-Cordum H. , Ali J., Andrews S. , Isak A., 

RA Vanbrunt A., Nguyen C. , Du F . , Lamar B. , Courtney L., Kalicki J., 

RA Ozersky P., Bielicki L., Scott K. , Holmes A., Harkins R., Harris A., 

RA Strong CM., Hou S., Tomlinson C. , Dauphin -Kohlberg S., 

RA Kozlowicz-Reilly A., Leonard S. , Rohlfing T., Rock S.M., 

RA Tin-Wollam A. -M. , Abbott A., Minx P., Maupin R. , Strowmatt C, 

RA Latreille P., Miller N. , Johnson D. , Murray J., Woessner J. P., 

RA Wendl M.C, Yang S.-P., Schultz B.R., Wallis J.W., Spieth J., 

RA Bieri T.A. , Nelson J.O., Berkowicz N. , Wohldmann P.E., Cook L.L., 

RA Hickenbotham M.T., Eldred J., Williams D. , Bedell J. A., Mardis E.R. , 

RA Clifton S.W., Chissoe S.L., Marra M.A., Raymond C, Haugen E. , 

RA Gillett W., Zhou Y., James R., Phelps K . , Iadanoto S., Bubb K., 

RA Simms E. , Levy R., Clendenning J., Kaul R., Kent W.J., Furey T.S., 

RA Baertsch R.A., Brent M.R., Keibler E. , Flicek P., Bork P., Suyama M., 

RA Bailey J. A., Portnoy M.E., Torrents D., Chinwalla A.T., Gish W.R., 

RA Eddy S.R., McPherson J.D., Olson M.V. , Eichler E.E., Green E.D., 

RA Waterston R.H. , Wilson R.K.; 

RT "The DNA sequence of human chromosome 7."; 

RL Nature 4 24:157-164(2 003). 

RN [5] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] . 

RC TISSUE=Bone marrow; 

RX MEDLINE=22 3882 57 ; PubMed=l 24 77 932 ; DOI=10 . 1 07 3/pnas . 2 42 6038 99 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H. , Derge J.G. , 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T. , Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K. , Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A. , McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C, Hale S. , Garcia A.M., Gay L.J., Hulyk S.W., 
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RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J. , Helton E . , Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences . " ; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [6] 

RP PROTEIN SEQUENCE OF 24-38. 

RX PubMed=15340161; DOI =10 . 1110/ps . 046 82504 ; 

RA Zhang Z. , Henzel W.J.; 

RT "Signal peptide prediction based on analysis of experimentally 

RT verified cleavage sites."; 

RL Protein Sci. 13:2819-2824(2004). 

RN [7] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 74-206. 

RC TISSUE=Brain; 

RG The German cDNA consortium; 

RL Submitted (SEP-2004) to the EMBL/GenBank/DDB J databases. 

RN [8] 

RP FUNCTION, INTERACTIONS WITH BMP2 ; BMP 4 AND BMP7 , SUBCELLULAR LOCATION, 

RP AND TISSUE SPECIFICITY. 

RX PubMed=15020244; DOI=l 0 . 10 16/j .bbrc . 2 004 . 02 . 075 ; 

RA Yanagita M . , Oka M., Watabe T. , Iguchi H. , Niida A., Takahashi S., 

RA Akiyama T. , Miyazono K., Yanagisawa M., Sakurai T.; 

RT "USAG-1: a bone morphogenet ic protein antagonist abundantly expressed 

RT in the kidney."; 

RL Biochem. Biophys . Res. Commun. 316:490-500(2004). 

CC -!- FUNCTION: May be involved in the onset of endometrial receptivity 

CC for implantation/sensitization for the decidual cell reaction 

CC Enhances Wnt signaling and inhibits TGF-beta signaling (By 

CC similarity) . Directly antagonizes activity of BMP2, BMP4, BMP6 and 

CC BMP7 in a dose -dependent manner. 

CC -!- SUBUNIT: Interacts with BMP2 , BMP4 , BMP6 and BMP7 with high 
CC affinity. 

CC SUBCELLULAR LOCATION: Secreted protein. 

CC -!- TISSUE SPECIFICITY: Highly expressed in kidney and weakly in lung. 
CC SIMILARITY: Belongs to the sclerostin family. 

CC -!- SIMILARITY: Contains 1 CTCK (C-terminal cystine knot -like) domain. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons At tribut ion-NoDerivs License 

CC 

DR EMBL; AB059270; BAC20331.1; -; mRNA. 

DR EMBL; AF361494; AAL57219.1; -; mRNA. 

DR EMBL; AY255634 ; AAQ83296.1; -; mRNA. 

DR EMBL; AC079155; AAQ96855.1; -; Genomic_DNA. 

DR EMBL; BC008484; AAH08484.1; -; mRNA. 

DR EMBL; AL050024; CAB43243.2; -; mRNA. 

DR PIR; T08710; T08710. 

DR Ensembl; ENSG00000171243 ; Homo sapiens. 

DR HGNC; HGNC:21748; SOSTDC1 . 

DR MIM; 609675; gene. 

DR InterPro; IPR0 06 20 7; Cys_knot_C. 

DR InterPro; IPR008835; Sclerostin. 

DR PANTHER; PTHR149 03; Sclerostin; 1. 

DR Pfam; PF05463; Sclerostin; 1. 

DR PROSITE; PS01185; CTCKJL ; FALSE_NEG . 
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DR 


PROSITE; 


PS01225; 


CTCK_2; 1. 








KW 


Direct protein sequencing; Glycoprotein; Signal; 




KW 


Wnt signaling pathway. 








FT 


SIGNAL 


1 


23 








FT 


CHAIN 


24 


2 06 


Sclerostin domain -containing protein 


FT 








/FTId=PRO_00 0003 3180 . 




FT 


DOMAIN 


75 


170 


CTCK. 






FT 


CARBOHYD 


47 


47 


N-linked (GlcNAc 


. . .) 


(Potential ) . 


FT 


CARBOHYD 


173 


173 


N-linked (GlcNAc 


. . .) 


(Potential ; . 


FT 


DISULFID 


75 


133 


By similarity. 






FT 


DISULFID 


89 


147 


By similarity. 






FT 


DISULFID 


100 


163 


By similarity. 






FT 


DISULFID 


104 


165 


By similarity. 






FT 


CONFLICT 


8 


14 


FYLLPLA -> LSLIPLL (in 


Ref. 3) . 


FT 


CONFLICT 


20 


20 


S -> N (in Ref . 


3) . 




FT 


CONFLICT 


41 


41 


P -> S (in Ref. 


3) . 




FT 


CONFLICT 


62 


62 


N -> S (in Ref. 


3) . 




FT 


CONFLICT 


69 


69 


T -> S (in Ref. 


3) . 




FT 


CONFLICT 


106 


106 


P -> L (in Ref. 


7) . 




FT 


CONFLICT 


127 


127 


S -> G (in Ref . 


3) . 




FT 


CONFLICT 


182 


182 


M -> V (in Ref. 


3) . 




FT 


CONFLICT 


188 


188 


V - > A (in Ref. 


3) . 




FT 


CONFLICT 


205 


205 


M -> L (in Ref. 


3) . 




SQ 


SEQUENCE 


2 06 AA; 


23307 MW 


; 9FB3CC41E4B53834 CRC64 ; 


Query Match 




100.0%; 


Score 1115; DB 


1; Length 206; 


Best Local Similarity 


100 .0%; 


Pred. No. 5.1e- 


100; 





Matches 206; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


l 


MLPPAIHFYLLPLACILMKSCLAFKNDATEILYSHVVKPVPAHPSSNSTLNQARNGGRHF 


60 






1 MM II II II II II II II II III II II II II II II II II II II II MM II II II II II 




Db 


1 


MLP PAIHFYLLPLAC ILMKSCLAF KNDATE ILYSHVVKPVPAHP SSNSTLNQARNGGRHF 


60 


Qy 


61 


SNTGLDRNTRVQVGCRELRSTKYISDGQCTSISPLKELVCAGECLPLPVLPNWIGGGYGT 


120 






1 II II II II II M II II II II III M II II II II II II II II II II II II II II II II II 




Db 


61 


SNTGLDRNTRVQVGCRELRSTKYI SDGQCTSI SPLKELVCAGECLPLPVLPNWIGGGYGT 


120 


Qy 


121 


KYWSRRSSQEWRCVNDKTRTQRIQLQCQDGSTRTYKITWTACKCKRYTRQHNESSHNFE 


180 






1 II II II II II II II II II II III II II II II II II II MM II II II II II II II II II 




Db 


121 


KYWSRRSSQEWRCVNDKTRTQRIQLQCQDGSTRTYKITWTACKCKRYTRQHNESSHNFE 


180 


Qy 


181 


SMSPAKPVQHHRERKRASKSSKHSMS 2 06 
1 M II II II M II II II M II III II 




Db 


181 


SMSPAKPVQHHRERKRASKSSKHSMS 2 06 





RESULT 2 
SOSD1 PONPY 



ID SOSDl_PONPY STANDARD; PRT ; 206 AA . 

AC Q5R5D2; 

DT 29-MAR-2005, integrated into UniProtKB/Swi ss -Prot . 

DT 21-DEC-2004, sequence version 1. 

DT 07-FEB-2006, entry version 11. 

DE Sclerostin domain-containing protein 1 precursor. 

GN Name=SOSTDCl; 

OS Pongo pygmaeus (Orangutan) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontogli res ; Primates; Catarrhini; Hominidae; 

OC Pongo. 

OX NCBI_TaxID=9 60 0; 

RN [1] 
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RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] . 

RC TISSUE=Kidney; 

RG The German cDNA consortium; 

RL Submitted (NOV-2004) to the EMBL/GenBank/DDBJ databases. 

CC -!- FUNCTION: Directly antagonizes activity of BMP2 , BMP4 , BMP6 and 
CC BMP7 in a dose -dependent manner. Enhances Wnt signaling and 

CC inhibits TGF-beta signaling. May be involved in the onset of 

CC endometrial receptivity for implantat ion/sensit izat ion for the 

CC decidual cell reaction (By similarity) . 

CC -!- SUBUNIT: Interacts with BMP2 , BMP4 , BMP6 and BMP7 with high 
CC affinity (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Secreted protein (By similarity). 
CC SIMILARITY: Belongs to the sclerostin family. 

CC SIMILARITY: Contains 1 CTCK (C-terminal cystine knot-like) domain. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribut ion-NoDerivs License 

CC 

DR EMBL; CR86 0930; CAH93034.1; -; mRNA. 

DR InterPro; IPR006207; Cys_knot_C. 

DR InterPro; IPR008835; Sclerostin. 

DR PANTHER; PTHR14903; Sclerostin; 1. 

DR Pfam; PF05463; Sclerostin; 1. 

DR PROSITE; PS01185; CTCK_1 ; FALSE_NEG . 

DR PROSITE; PS012 25; CTCK_2 ; 1. 

KW Glycoprotein; Signal; Wnt signaling pathway. 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



Matches 205; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

MLPPAIHFYLLPLACILMKSCLAFKNDATEILYSHVVKPVPAHPSSNSTLNQARNGGRHF 60 

I II II II II II II II II II II III II II II II II II II II II II II II II II II II II II 
MLPPAIHFYLLPLACILMKSCLAFKNDATEILYSHVVKPVPAHPSSNSTLNQARNGGRHF 60 

SNTGLDRNTRVQVGCRELRSTKYI SDGQCTSI SPLKELVCAGECLPLPVLPNWIGGGYGT 12 0 
I II II II II II II II II II Mill II II II II II II II II II II II I IIIIIIMIMI 



1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 



SIGNAL 


1 


23 




By similarity. 




CHAIN 


24 


206 




Sclerostin domain- 


containing protein 










/FTId=PRO_0000033182 . 


DOMAIN 


75 


170 




CTCK. 




CARBOHYD 


47 


47 




N-linked (GlcNAc. 


. . ) (Potential) . 


CARBOHYD 


173 


173 




N-linked (GlcNAc. 


. . ) (Potential) . 


DISULFID 


75 


133 




By similarity. 




DISULFID 


89 


147 




By similarity. 




DISULFID 


100 


163 




By similarity. 




DISULFID 


104 


165 




By similarity. 




> SEQUENCE 


2 06 AA; 


23297 MW 


; 0F8CCEBE364817E6 


CRC64 ; 


Query Match 




99. 


3%; 


Score 1107; DB 1; 


Length 2 06 ; 


Best Local Similarity 


99. 


5%; 


Pred. No. 3e-99; 





Qy 


l 


Db 


l 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 



Illlllllllllllllllllllllll 



RESULT 3 
SOSD1 MOUSE 
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ID SOSDl_MOUSE STANDARD; PRT ; 206 AA . 

AC Q9CQN4 / Q8CF09 ; 

DT 29-MAR-2005, integrated into Uni Prot KB /Swiss- Pro t . 

DT 01-JUN-2001, sequence version 1. 

DT 07-MAR-2006, entry version 26. 

DE Sclerostin domain-containing protein 1 precursor (Ectodermal BMP 

DE inhibitor) (Ectodin) (Uterine sensi tization-associated gene 1 protein) 

DE (USAG-1) (Sclerostin-like protein) . 

GN Name=Sostdcl ; Synonyms =Sostl , Usagl ; 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa,- Chorda ta; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi ; 

OC Muroidea; Muridae; Murinae; Mus. 

OX NCBI_TaxID=100 90; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [MRNA] , AND DEVELOPMENTAL STAGE. 

RC STRAIN=C57BL/6 ; TISSUE=Testis ; 

RX MEDLINE=22505168; PubMed=l 26 17 826 ; DOI=10 . 1016/S1567- 133X ( 02 ) 00022 -4 ; 

RA Menke D.B., Page D.C.; 

RT "Sexually dimorphic gene expression in the developing mouse gonad."; 

RL Gene Expr. Patterns 2:359-367(2002). 

RN [2] 

RP NUCLEOTIDE SEQUENCE [MRNA], FUNCTION, INTERACTIONS WITH BMP 2 ; BMP4 ; 

RP BMP6 AND BMP 7, SUBCELLULAR LOCATION, TISSUE SPECIFICITY, DEVELOPMENTAL 

RP STAGE, AND INDUCTION. 

RX MEDLINE=22984999; PubMed=146 23 234 ; DOI=10 . 1016/ j . ydbio. 2003 . 08 . 011 ; 

RA Laurikkala J., Kassai Y. , Pakkasj aervi L. , Thesleff I., Itoh N. ; 

RT "Identification of a secreted BMP antagonist, ectodin, integrating 

RT BMP, FGF, and SHH signals from the tooth enamel knot."; 

RL Dev. Biol. 264:91-105(2003). 

RN [3] 

RP NUCLEOTIDE SEQUENCE [MRNA] . 

RC STRAIN=C57BL/6 ; 

RA 0' Shaughnessy R.F.L., Yeo W. , Gautier J., Jahoda C.A.B., 

RA Christiano A.M.; 

RT "A novel secreted WNT agonist is a requirement for epithelial - 

RT mesenchymal interactions . " ; 

RL Submitted (MAR-2003) to the EMBL/GenBank/DDB J databases. 

RN [4] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] . 

RC STRAIN=C57BL/6J; TISSUE=Kidney , and Pancreas; 

RX PubMed=16141072; DOI=10 . 1126/science . 1112014 ; 

RA Carninci P., Kasukawa T. , Katayama S. , Gough J., Frith M.C., Maeda N. , 

RA Oyama R. , Ravasi T., Lenhard B., Wells C. , Kodzius R. , Shimokawa K. , 

RA Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M., 

RA Davis M.J., Wilming L.G., Aidinis V., Allen J.E., 

RA Ambesi -Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L., 

RA Bansal M., Baxter L. , Beisel K.W., Bersano T. , Bono H., Chalk A.M., 

RA Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R., 

RA Crowe M.L., Dalla E. , Dalrymple B.P., de Bono B., Delia Gatta G., 

RA di Bernardo D. , Down T . , Engstrom P., Fagiolini M., Faulkner G. , 

RA Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M. , 

RA Georgii -Hemming P., Gingeras T.R., Gojobori T. , Green R.E., 

RA Gustincich S., Harbers M., Hayashi Y. , Hensch T.K., Hirokawa N. , 

RA Hill D., Huminiecki L. , Iacono M. , Ikeo K . , Iwama A., Ishikawa T. , 

RA Jakt M., Kanapin A., Katoh M., Kawasawa Y . , Kelso J., Kitamura H. , 

RA Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K., 

RA Kurochkin I.V., Lareau L.F., Lazarevic D. , Lipovich L., Liu J., 

RA Liuni S. , McWilliam S. , Madan Babu M. , Madera M., Marchionni L. , 

RA Matsuda H. , Matsuzawa S. , Miki H. , Mignone F. , Miyake S., Morris K. , 

RA Mottagui-Tabar S., Mulder N. , Nakano N., Nakauchi H., Ng P., 



http://es/ScoreAccessWeb/Get!tem.action?AppId=10627604&seqId=583754&Ite 8/15/06 



SCORE Search Results Details for Application 10627604 and Search Result us- 10-627-6... Page 8 of 25 



RA Nilsson R. , Nishiguchi S., Nishikawa S. , Nori F . , Ohara O., 

RA Okazaki Y. , Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G. , 

RA Petrovsky N. , Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M. , 

RA Rost B., Ruan Y. , Salzberg S.L., Sandelin A., Schneider C, 

RA Schonbach C. , Sekiguchi K. , Semple C.A., Seno S., Sessa L . , Sheng Y., 

RA Shibata Y. , Shimada H. , Shimada K., Silva D., Sinclair B. , 

RA Sperling S., Stupka E. , Sugiura K., Sultana R., Takenaka Y. , Taki K., 

RA Tammoja K. , Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A., 

RA Ueda H.R., van Nimwegen E . , Verardo R., Wei C.L., Yagi K. , 

RA Yamanishi H. , Zabarovsky E . , Zhu S. , Zimmer A . , Hide W. , Bult C, 

RA Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J., 

RA Wahlestedt C, Mattick J.S., Hume D.A., Kai C, Sasaki D. , Tomaru Y., 

RA Fukuda S., Kanamor i -Katayama M. , Suzuki M., Aoki J., Arakawa T., 

RA Iida J., Imamura K . , Itoh M. , Kato T. , Kawaji H . , Kawagashira N . , 

RA Kawashima T. , Kojima M . , Kondo S., Konno H. , Nakano K., Ninomiya N. , 

RA Nishio T., Okada M . , Plessy C. , Shibata K., Shiraki T., Suzuki S. , 

RA Ta garni M., Waki K. , Watahiki A., Okamura-Oho Y., Suzuki H. , Kawai J., 

RA Hayashizaki Y. ; 

RT "The transcriptional landscape of the mammalian genome."; 

RL Science 309:1559-1563(2005). 

RN [5] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] . 

RC STRAIN=FVB/N ; TI SSUE=Kidney ; 

RX MEDLINE=22388257; PubMed=124 77 932 ; DOI=10 . 1073/pnas .242603899 ; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T. , Max S.I., Wang J . , Hsieh F., 

RA Diatchenko L., Marusina K. , Farmer A. A., Rubin G.M. , Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C. , 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S. , Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W. , Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [6] 

RP TISSUE SPECIFICITY, AND DEVELOPMENTAL STAGE. 

RX PubMed=1502024 4; DOI =1 0 . 10 16 / j .bbrc .2 004 . 02 .0 75; 

RA Yanagita M., Oka M . , Watabe T. , Iguchi H. , Niida A., Takahashi S., 

RA Akiyama T. , Miyazono K., Yanagisawa M., Sakurai T.; 

RT "USAG-1: a bone morphogenet ic protein antagonist abundantly expressed 

RT in the kidney."; 

RL Biochem. Biophys. Res. Commun. 316:490-500(2004). 

CC -!- FUNCTION: May be involved in the onset of endometrial receptivity 
CC for implantation/sensitization for the decidual cell reaction. 

CC Enhances Wnt signaling and inhibits TGF-beta signaling (By 

CC similarity). Directly antagonizes activity of BMP 2 , BMP4, BMP6 and 

CC BMP7 in a dose -dependent manner. 

CC -!- SUBUNIT: Interacts with BMP2 , BMP4 , BMP6 and BMP7 with high 
CC affinity. 

CC -!- SUBCELLULAR LOCATION: Secreted protein. 

CC -!- TISSUE SPECIFICITY: Highly expressed in kidney at renal collecting 
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CC ducts level and weakly in brain. 

CC -!- DEVELOPMENTAL STAGE: Expression was first detected at Ell 

CC throughout the surface of the embryo, and it was most intense in 

CC the head region on the surfaces of the mandibular, maxillary, and 

CC frontonasal processes. At Ell. 5 expression is detected in the 

CC first and second branchial arches, pharynx and metanephros. At 

CC E12-E14, expression was intense and strikingly confined to 

CC developing ectodermal organs. The vibrissae, tylotrich hair 

CC follicles, tongue papillae, and tooth germs as well as the ear 

CC auricle. Also expressed intensely in kidney epithelium in the 

CC stalk and tips of ureter as well as in the spermatic ducts in the 

CC testis. At E17.5 strong expression was restricted to kidney 

CC tubules and ameloblasts in teeth, and moderate expression was 

CC observed in hair follicles, choroids plexus of the fourth cerebral 

CC ventricle of the brain. First detected on E12.5 in interstitial 

CC cell of the testis and increased towards E14.5. On 8 dpp (day post 

CC partum) highly expression was detected in kidney and weakly in 

CC skin. 

CC -!- INDUCTION: Up-regulated by BMP2 and BMP 7 . Down-regulated by FGF4 
CC and SHH. 

CC -!- SIMILARITY: Belongs to the sclerostin family. 

CC -!- SIMILARITY: Contains 1 CTCK (C-terminal cystine knot-like) domain. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons At tr ibut ion-NoDerivs License 

CC 

DR EMBL; AY134666; AAN08617.1; -; mRNA. 

DR EMBL; AB059271; BAC20332.1; -; mRNA. 

DR EMBL; AY255635; AAQ83297.1; -; mRNA. 

DR EMBL; AK002240; BAB2 1957.1; -; mRNA. 

DR EMBL; AK0023 96; BAB22068.1; -; mRNA. 

DR EMBL; AK007893; BAB25333.1; -; mRNA. 

DR EMBL; AK007935; BAC25193.1; -; mRNA. 

DR EMBL; AK007967; BAB25378.1; -; mRNA. 

DR EMBL; BC021458; AAH21458.1; -; mRNA. 

DR Ensembl; ENSMUSG00000036 16 9 ; Mus musculus . 

DR MGI; MGI:1913292; Sostdcl. 

DR GO; GO:0005615; C : extracel lular space; IDA. 

DR GO; GO:0005515; F:protein binding; IPI. 

DR GO; GO:0030514; P:negative regulation of BMP signaling pathway; IDA. 

DR InterPro; IPR006207; Cys_knot_C. 

DR InterPro; IPR008835; Sclerostin. 

DR PANTHER; PTHR14903; Sclerostin; 1. 

DR Pfam; PF05463; Sclerostin; 1. 

DR PR0SITE; PS01185; CTCK_1 ; FALSE_NEG . 

DR PR0SITE; PS012 25; CTCK_2 ; 1. 

KW Glycoprotein; Signal; Wnt signaling pathway. 



FT 


SIGNAL 


1 


23 


By similarity. 




FT 


CHAIN 


24 


206 


Sclerostin domain- 


containing protein 


FT 








/FTId=PRO_00 00 03 3181 . 


FT 


DOMAIN 


75 


170 


CTCK. 




FT 


CARB0HYD 


47 


47 


N-linked (GlcNAc. 


. .) (Potential). 


FT 


CARB0HYD 


173 


173 


N-linked (GlcNAc. 


. . ) (Potential) . 


FT 


DISULFID 


75 


133 


By similarity. 




FT 


DISULFID 


89 


147 


By similarity. 




FT 


DISULFID 


100 


163 


By similarity. 




FT 


DISULFID 


104 


165 


By similarity. 




FT 


CONFLICT 


69 


69 


S -> R (in Ref . 4; 


BAC25193) . 


FT 


CONFLICT 


80 


80 


S -> T (in Ref . 4; 


BAC25193) . 


SQ 


SEQUENCE 


2 06 AA; 


23174 


MW; 70D24819EB06CBC9 


CRC64 ; 
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Query Match 95.7%; Score 1067; DB 1; Length 2 06; 

Best Local Similarity 94.7%; Pred . No. 2.4e-95; 

Matches 195; Conservative 7; Mismatches 4; Indels 0; Gaps 0; 

MLPPAIHFYLLPLACILMKSCLAFKNDATEILYSHVVKPVPAHPSSNSTLNQARNGGRHF 60 

IIIIIM Ml I II h:| III II II II II II II MM MM II II II II II II II II 

MLPPAIHLSLIPLLCILMRNCLAFKNDATEILYSHVVKPVPAHPSSNSTLNQARNGGRHF 60 
SNTGLDRNTRVQVGCRELRSTKYISDGQCTSISPLKELVCAGECLPLPVLPNWIGGGYGT 12 0 

I M MM MM II II II M M Ml II II II M M II II II II M II MM II M II II II 

SSTGLDRNSRVQVGCRELRSTKYI SDGQCTSISPLKELVCAGECLPLPVLPNWIGGGYGT 12 0 
KYWSRRSSQEWRCVNDKTRTQRIQLQCQDGSTRTYKITWTACKCKRYTRQHNESSHNFE 18 0 

I II II M II II II II II II II III M II II II II M II II II M II M II II II II II II 

KYWSRRSSQEWRCVNDKTRTQRIQLQCQDGSTRTYKITWTACKCKRYTRQHNESSHNFE 18 0 
SMSPAKPVQHHRERKRASKSSKHSMS 2 06 

hlllll I II M II M II II III M 



Qy 


l 


Db 


1 


QY 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 



RESULT 4 
SOSDl_RAT 

ID SOSDl_RAT STANDARD; PRT ; 206 AA. 

AC Q642G2; Q8CJA4 ; 

DT 29-MAR-2005, integrated into UniProtKB/Swi ss -Prot . 

DT 25-OCT-2004, sequence version 1. 

DT 07-FEB-2006, entry version 13. 

DE Sclerostin domain-containing protein 1 precursor (Uterine 

DE sensitization-associated gene 1 protein) (USAG-1) (Wnt - signaling 

DE modulator) . 

GN Name=Sostdcl ; Synonyms =Usagl , Wise; 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodent ia; Sciurognathi ; 

OC Muroidea; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116 ; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [MRNA] , FUNCTION, TISSUE SPECIFICITY, AND 

RP INDUCTION. 

RC STRAIN=Sprague-Dawley; TISSUE=Uterus ; 

RX MEDLINE=22277976 ; PubMed=l 23 90 898 ; DOI=10 . 1 095/biolreprod . 102 . 00685 8 ; 

RA Simmons D.G., Kennedy T.G. ; 

RT "Uterine sensitization-associated gene-1: a novel gene induced within 

RT the rat endometrium at the time of uterine recept ivity/sensit izat ion 

RT for the decidual cell reaction."; 

RL Biol. Reprod. 67:1638-1645(2002). 

RN [2] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] . 

RC TISSUE=Heart ; 

RG NIH - Mammalian Gene Collection (MGC) project; 

RL Submitted (SEP-2004) to the EMBL/GenBank/DDB J databases. 

RN [3] 

RP FUNCTION, SUBCELLULAR LOCATION, TISSUE SPECIFICITY, AND DEVELOPMENTAL 

RP STAGE . 

RX PubMed=153 73 76 4; DOI =10 . 11 11/ j . 00 22 -202X. 20 04 . 2 34 10 .X; 

RA O' Shaughnessy R.F.L., Yeo W. , Gautier J., Jahoda C.A.B., 

RA Christiano A.M.; 

RT "The WNT signalling modulator, Wise, is expressed in an interaction - 

RT dependent manner during hair-follicle cycling."; 

RL J. Invest. Dermatol. 123:613-621(2004). 
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CC -I- FUNCTION: Directly antagonizes activity of BMP2 , BMP4 , BMP6 and 

CC BMP7 in a dose -dependent manner (By similarity) . May be involved 

CC in the onset of endometrial receptivity for 

CC implantation/sensitization for the decidual cell reaction. 

CC Enhances Wnt signaling and inhibits TGF-beta signaling. 

CC -!- SUBUNIT: Interacts with BMP2 , BMP4 , BMP6 and BMP7 with high 

CC affinity (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Secreted protein. 

CC -!- TISSUE SPECIFICITY: Higly expressed within the maximally 

CC sensitized/receptive endometrium. Weakly expressed in brain, 

CC kidney and the female reproductive tract. Expressed in the dermal 

CC papilla (DP) and at high level in the precortex of both anagen 

CC vibrissae and pelage follicles. Dynymic expression during the hair 

CC cycle. 

CC -!- DEVELOPMENTAL STAGE: Highly expressed in epidermis, dermis and the 

CC outermost periderm layer in the 17 day post -coi turn (dpc) . 

CC -!- INDUCTION: Up-regulated at day 5 pregnant or pseudopregnant of the 

CC uterine glandular epithelial cells, at time of maximal 

CC sensitization for the decidual cell reaction. Down-regulated at 

CC day 6 refractory uterus. 

CC -!- SIMILARITY: Belongs to the sclerostin family. 

CC -!- SIMILARITY: Contains 1 CTCK (C- terminal cystine knot-like) domain. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons At tribut ion-NoDerivs License 

CC 

DR EMBL; AF411056; AAN45848.1; - ; mRNA. 

DR EMBL; BC081710; AAH81710.1; -; mRNA. 

DR Ensembl; ENSRNOG00000005770; Rattus norvegicus. 

DR RGD; 628877; Sostdcl . 

DR InterPro; IPR0 06 207; Cys_knot_C. 

DR InterPro; IPR008835; Sclerostin. 

DR PANTHER; PTHR14903; Sclerostin; 1. 

DR Pfam; PF05463; Sclerostin; 1. 

DR PROSITE; PS01185; CTCK_1 ; FALSE_NEG . 

DR PROSITE; PS012 25; CTCK_2 ; 1. 

KW Glycoprotein; Signal; Wnt signaling pathway. 



FT 


SIGNAL 


1 


23 


By similarity. 




FT 


CHAIN 


24 


206 


Sclerostin domain- 


containing protein 1. 


FT 








/FTId=PRO_0000033183 . 


FT 


DOMAIN 


75 


170 


CTCK. 




FT 


CARBOHYD 


47 


47 


N-linked (GlcNAc. 


. . ) (Potential) . 


FT 


CARBOHYD 


173 


173 


N-linked (GlcNAc. 


. . ) (Potential) . 


FT 


DISULFID 


75 


133 


By similarity. 




FT 


DISULFID 


89 


147 


By similarity. 




FT 


DISULFID 


100 


163 


By similarity. 




FT 


DISULFID 


104 


165 


By similarity. 




FT 


CONFLICT 


187 


187 


P -> L (in Ref . 1) 




SQ 


SEQUENCE 


2 06 AA; 


23136 


MW; 52BD6024 31C6DD15 


CRC64 ; 


Query Match 




95.23 


ir; Score 1062; DB 1; 


Length 2 06; 



Best Local Similarity 94.7%; Pred. No. 7.2e-95; 

Matches 195; Conservative 6; Mismatches 5; Indels 0; Gaps 0; 
Qy 1 MLPPAIHFYLLPLACILMKSCLAFKNDATEILYSHVVKPVPAHPSSNSTLNQARNGGRHF 60 

IMIIII hll I II Ihl III II II II II II II II II I II II II II MM II MM 

Db 1 MLPPAIHLSLI PLLCILMKNCLAFKNDATEILYSHVVKPVSAHPSSNSTLNQARNGGRHF 60 

Qy 61 SNTGLDRNTRVQVGCRELRSTKYISDGQCTSISPLKELVCAGECLPLPVLPNWIGGGYGT 12 0 

IHII Mhll MM II II Mill llllll MINI MINI IMIIMIIIM llllll 

Db 61 SSTGLDRNSRVQVGCRELRSTKYISDGQCTSISPLKELVCAGECLPLPVLPNWIGGGYGT 120 
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Qy 121 KYWSRRSSQEWRCVNDKTRTQRIQLQCQDGSTRTYKITWTACKCKRYTRQHNESSHNFE 18 0 

I II II II MM II II II II II III II II MINI MM II II II II II II II II II II II 

Db 121 KYWSRRSSQEWRCVNDKTRTQRIQLQCQDGSTRTYKITWTACKCKRYTRQHNESSHNFE 18 0 

Qy 181 SMS PAKP VQHHRE RKRASKSS KHS MS 2 06 

MINI I 1 1 1 1 1 1 1 1 1 1 1 1 II h I 

Db 181 S VS PAKP AQHHRE RKRASKSS KHSLS 2 06 



RESULT 5 
SOSD1 CHICK 



ID SOSDl_CHICK STANDARD; PRT ; 206 AA . 

AC Q6VYA3 ; 

DT 29-MAR-2005, integrated into UniProtKB/Swiss-Prot . 

DT 05-JUL-2004, sequence version 1. 

DT 07-FEB-2006, entry version 10. 

DE Sclerostin domain- containing protein 1 precursor (Wnt -signaling 

DE modulator) . 

GN Name=SOSTDCl ; Synonyms =WISE ; 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Ch or data; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID = 9 03 1 ; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [MRNA] , FUNCTION, INTERACTION WITH LRP6 , AND 

RP DEVELOPMENTAL STAGE. 

RX MEDLINE=22781984; PubMed=12900447 ; DOI=10 . 1242/dev. 00674 ; 

RA Itasaki N. , Jones CM., Mercurio S. , Rowe A., Domingos P.M., 

RA Smith J.C., Krumlauf R.; 

RT "Wise, a context-dependent activator and inhibitor of Wnt 

RT signalling . " ; 

RL Development 13 0:4295-4305(2003). 

CC -!- FUNCTION: Can activate or inhibit Wnt signaling in a context- 

CC dependent manner. Activates the canonical Wnt pathway whereby acts 

CC through Disheveled proteins and beta-cat enin . Antagonises Wnt 

CC signaling through the canonical pathways presumably by blocking 

CC accessibily of certain WNTs to their receptors. Induces posterior 

CC neural markers via components of the canonical Wnt pathway. 

CC -!- SUBUNIT: Interacts with LRP6 . 

CC -!- SUBCELLULAR LOCATION: Secreted protein. 

CC -!- DEVELOPMENTAL STAGE: Expression was first detected broadly at 
CC stage 9, and then localised in the posterior surface ectoderm 

CC overlying the presomitic mesoderm by stage 10-11 embryo. 

CC -!- SIMILARITY: Belongs to the sclerostin family. 

CC -!- SIMILARITY: Contains 1 CTCK (C-terminal cystine knot-like) domain. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribut ion-NoDerivs License 

CC 

DR EMBL; AY319926; AAQ57626.1; -; mRNA. 

DR InterPro; IPR006207; Cys_knot_C. 

DR InterPro; IPR008835; Sclerostin. 

DR PANTHER; PTHR14 9 03 ; Sclerostin; 1. 

DR Pfam; PF05463; Sclerostin; 1. 

DR PROSITE; PS01185; CTCK_1 ; FALSE_NEG . 

DR PROSITE; PS01225; CTCK_2 ; 1. 

KW Glycoprotein; Signal; Wnt signaling pathway. 

FT SIGNAL 1 22 Potential. 

FT CHAIN 23 206 Sclerostin domain-containing protein 1. 
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FT 








/FTId=PRO_0000033184 . 


FT DOMAIN 


75 


170 




CTCK . 




FT CARBOHYD 


47 


47 




N-linked (GlcNAc. . 


. .) (Potential) 


FT CARBOHYD 


173 


173 




N-linked (GlcNAc. , 


. .) (Potential) 


FT DISULFID 


75 


133 




By similarity. 




FT DISULFID 


8 9 


147 




By similarity. 




FT DISULFID 


100 


163 




By similarity. 




FT DISULFID 


104 


165 




By similarity. 




SQ SEQUENCE 


2 06 AA; 


23144 MW 


; 75F438854599CEAD 


CRC64 ; 


Query Match 




86. 


9%; 


Score 96 9; DB 1; 


Length 206; 


Best Local Similarity 


87. 


9%; 


Pred. No. 8e-86; 





Matches 181; Conservative 



6; Mismatches 19; Indels 



0; Gaps 



0; 



Qy 


i 


Db 


i 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 



MLPPAIHFYLLPLACILMKSCLAFKNDATEILYSHVVKPVPAHPSSNSTLNQARNGGRHF 60 

II Mill I III :| II II II MM II II II I II I II I Ml II II II II M 
MLLSAIHFYGLLLACTFTRSYSAFKNDATEILYSHVVKPAPASPSSNSTLNQARNGGRHY 60 

SNTGLDRNTRVQVGCRELRSTKYISDGQCTSISPLKELVCAGECLPLPVLPNWIGGGYGT 12 0 

* II III II 11 II II II II III MM II MM II II II II II II MM II II II II II 

AGTGSDRNNRVQVGCRELRSTKYISDGQCTSINPLKELVCAGECLPLPLLPNWIGGGYGT 12 0 
KYWSRRSSQEWRC WDKTRTQRIQLQCQDGSTRTYKITVVTACKCKRYTRQHNESSHNFE 18 0 

I II II II II II II II II II II III MM II I II II II II MM II II II II II II M II 

KYWSRRSSQEWRCVNDKTRTQRIQLQCQDGSIRTYKITWTACKCKRYTRQHNESSHNFE 18 0 
SMS PAKP VQHHRERKRASKSS KHSMS 2 06 

I I II II MM II II II II III I 

GTSQAKPVQHHKERKRASKSSKHSTS 2 06 



RESULT 6 
Q6VYA2_XENLA 

ID Q6 VYA2_XENLA PRELIMINARY; PRT ; 213 AA . 

AC Q6VYA2 ; 

DT 05-JUL-2004, integrated into Uni ProtKB/TrEMBL . 

DT 05-JUL-2004, sequence version 1. 

DT 07-FEB-2006, entry version 9. 

DE Wise -A. 

OS Xenopus laevis (African clawed frog) . 

OC Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 

OC Amphibia; Batrachia; Anura; Mesobatrachia ; Pi po idea; Pi pi da e; 

OC Xenopodinae; Xenopus; Xenopus. 

OX NCBI_TaxID=8 35 5 ; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RX MEDLINE=22781984; PubMed=129 00447 ; DOI=10 . 1242/dev. 00674 ; 

RA Itasaki N. , Jones CM., Mercurio S. , Rowe A., Domingos P.M., 

RA Smith J.C., Krumlauf R.; 

RT "Wise, a context-dependent activator and inhibitor of Wnt 

RT signalling . " ; 

RL Development 13 0:4295-4305(2003). 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons At tr ibut ion-NoDer ivs License 

CC 

DR EMBL; AY319927; AAQ57627.1; -; mRNA. 

DR InterPro; IPR006207; Cys_knot_C. 

DR InterPro; IPR008835; Sclerostin. 

DR PANTHER; PTHR14903; Sclerostin; 1. 

DR Pfam; PF05463; Sclerostin; 1. 
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DR PROSITE; PS012 25; CTCK_2 ; 1. 

SQ SEQUENCE 213 AA; 24 35 9 MW ; 32 72 FC53 57A4 70 ID CRC64 ; 

Query Match 73.3%; Score 817.5; DB 2; Length 213; 

Best Local Similarity 75.2%; Pred. No. 4.6e-71; 

Matches 152; Conservative 21; Mismatches 20; Indels 9; Gaps 3; 

Qy 10 LLPLACI LMKS CL AF KNDATE ILY SH WKP VP AH PS SN STLNQARNGGRH FSNTGLDRNT 69 

:| M M l-l II II II III II | : : : : || | | | | | | || | : :|: Ml 

Db 10 MLYLACI LI ESCVSF KNDATE ILY SH VDKH I - QDS ANSSTLNQARNGGRNAANSALDRTN 68 

Qy 70 RVQVGCRELRSTKYISDGQCTSISPLKELVCAGE CLPLPVLPNWIGGGYGTKYWSRRSSQ 12 9 

II II II II II II II II II II II II INI II II llhll II II II II lllllllll 
Db 69 HHQVGCRELRSTKYISDGQCTSIQPLKELVCAGECLPLPILPNWIGGGYGLKYWSRRSSQ 12 8 

Qy 130 E WR CVND KT RT QR I Q LQ CQ DG STR TY KI TV VT AC KC KR YT RQ HN ESSHNFESMSPAKPV- 18 8 

I II II II II II II II II hlhll II hll Ihl II II II II II II II I- Mill 
Db 129 EWRCVNDKTRTQRIQLQCEDGTTRTYKVTVVTTSCKCKRYTRQHNESSHNYQGASPIKPVH 18 8 

Qy 189 --QHH RERKRASKSSKH 203 

Ml h:|| I III 

Db 189 SHQHHHFHHNRDKKRLI KMSKH 210 



RESULT 7 

Q68 8C5_XENTR 

ID Q688C5_XENTR PRELIMINARY; PRT ; 213 AA . 

AC Q6 88C5; 

DT ll-OCT-2004, integrated into UniProtKB/TrEMBL . 

DT 11 -OCT -2004, sequence version 1. 

DT 07-FEB-2006, entry version 10. 

DE Wise -A. 

OS Xenopus tropicalis (Western clawed frog) (Silurana tropicalis) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Amphibia; Batrachia; Anura ; Mesobatrachia ; Pipoidea; Pipidae; 

OC Xenopodinae; Xenopus ; Silurana. 

OX NCBI_TaxID=8 36 4; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RA Qin S. , Dors M. # Johnson E., Bloom S., Hood L., Rowen L.; 

RT "Sequence of xenopus tropicalis development genes."; 

RL Submitted (SEP-2004) to the EMBL/GenBank/DDBJ databases. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribut ion-NoDerivs License 

CC 

DR EMBL; AC151465; AAU12851.1; -; Genomic_DNA. 

DR Ensembl; ENSXETG00000022798 ; Xenopus tropicalis. 

DR InterPro; IPR006207; Cys_knot_C. 

DR InterPro; IPR008835; Sclerostin. 

DR PANTHER; PTHR14903; Sclerostin; 1. 

DR Pfam; PF05463; Sclerostin; 1. 

DR PROSITE; PS01225; CTCK_2 ; 1. 

SQ SEQUENCE 213 AA; 24400 MW; 80 18 13CF69A1 0AC2 CRC64 ; 

Query Match 72.8%; Score 811.5; DB 2; Length 213; 

Best Local Similarity 73.8%; Pred. No. 1.7e-70; 

Matches 14 9; Conservative 21; Mismatches 23; Indels 9; Gaps 3; 

Qy 10 LLPLACI LMKS CLAF KNDATE ILY SH WKP VP AH PS SNSTLNQARNGGRH FSNTGLDRNT 69 

\ III -I MM II II II III II I I : :::| llllllllll :|: Ml 
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Db 10 MLYFACI FIESCMSFKNDATEILYSHVDKNI -QESANSSALNQARNGGRHTANSAMDRTN 68 

Qy 70 R VQ VG CR ELRS TKYI SDGQ CT S I S PL KE LVCAGE CL PL P VLPNW IGGG YGTK YW SRRS SQ 129 

IIIIIIIIMIIIIIIIIIII II II II II II MM hll II II II II lllllllll 

Db 69 PHQVGCRELRSTKYISDGQCTSIQPLKELVCAGECLPLPILPNWIGGGYGLKYWSRRSSQ 12 8 

Qy 130 EWRCVNDKTRTQRIQLQCQDGSTRTYKITVVTACKCKRYTRQHNESSHNFESMSPAKPV- 18 8 

I II II II II II II II II hll Mill MM MM II II II II II II II Ml II ||: 

Db 129 E WR CVND KT RT QR IQLQCEDGTTR TY KVTV VT S C KC KR YT RQ HN E S S H NY EGAS PM KP I H 18 8 
Qy 189 --QHH RERKRAS KSSKH 203 

III MMI I Ml 

Db 189 SLQHHHSHHNRDKKRLI KMSKH 210 



RESULT 8 
Q6X4U2_XENLA 

ID Q6X4U2_XENLA PRELIMINARY; PRT ; 213 AA. 

AC Q6X4U2; 

DT 05-JUL-2004, integrated into UniProtKB/TrEMBL . 

DT 05-JUL-2004, sequence version 1. 

DT 07-FEB-2006, entry version 11. 

DE Uterine sensitizat ion-associated protein-1. 

GN Name=Usagl ; 

OS Xenopus laevis (African clawed frog) . 

0C Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 

0C Amphibia; Batrachia; Anura ; Mesobatrachia ; Pipoidea; Pipidae; 

0C Xenopodinae; Xenopus; Xenopus. 

OX NCBI_TaxID=8 35 5; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RA 0' Shaughnessy R.F.L., Yeo W. , Gautier J., Jahoda C.A.B., 

RA Christ iano A.M.; 

RT "A novel secreted WNT agonist is a requirement for epithelial - 

RT mesenchymal interactions."; 

RL Submitted (MAR-2003) to the EMBL / GenBa nk /DDB J databases. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribut ion-NoDerivs License 

CC 

DR EMBL; AY255636 ; AAQ832 98.1; -; mRNA. 

DR InterPro; IPR0 06 20 7; Cys_knot_C. 

DR InterPro; IPR008835; Sclerostin. 

DR PANTHER; PTHR14903; Sclerostin; 1. 

DR Pfam; PF05463; Sclerostin; 1. 

DR PROSITE; PS012 25; CTCK_2 ; 1. 

SQ SEQUENCE 213 AA; 24286 MW; 43 E0AFCF6A78FFC5 CRC64 ; 



Query Match 71.4%; Score 796.5; DB 2; Length 213; 

Best Local Similarity 73.8%; Pred. No. 5e-69; 

Matches 149; Conservative 22; Mismatches 22; Indels 9; Gaps 3; 
Qy 10 LLPLACI LMKSCLAFKNDATE ILY SH WKP VPAHPS SNSTLNQARNGGRHFSNTGLDRNT 69 

I II II MM MM II II II III III I : -Mill II II MM :|» ||| 

Db 10 MLYLACI LIES CVSFKNDATE ILYSH VDKH I - QDS ANSSTLNQARNGGRNAANSALDRTN 68 

Qy 70 RVQVGCRELRSTKYISDGQCTSISPLKELVCAGECLPLPVLPNWIGGGYGTKYWSRRSSQ 12 9 

IIIIIIIIMIIIIIIIIIII llllllllllllll :| :||ll||| IIIIMIII 
Db 69 HHQVGCRELRSTKYI SDGQ CTSIQPLKELVCAGECLPLSILAHWIGGGYGLKYW SRRS SQ 12 8 

Qy 130 EWRCVNDKTRTQRIQLQCQDGSTRTYKITVVTACKCKRYTRQHNESSHNFESMSPAKPV- 18 8 
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I II II II II II II II II |:|hllll hll II :| II II II II II II II I- II III 
Db 129 EWRCVNDKTRTQRIQLQCEDGTTRTYKVTVVTSCKCKRYTRQHNESSHNYQGASPIKPVH 188 

Qy 189 --QHH RERKRAS KSSKH 203 

III h:|l I Ml 

Db 189 SHQHHHSHHNRDKKRLI KMSKH 2 10 



RESULT 9 
Q66KQ4_XENLA 

ID Q6 6KQ4_XENLA PRELIMINARY; PRT ; 18 7 AA. 

AC Q6 6 KQ4; 

DT ll-OCT-2004, integrated into UniProtKB/TrEMBL . 

DT ll-OCT-2004, sequence version 1. 

DT 07-FEB-2006, entry version 11. 

DE Hypothetical protein (Fragment). 

OS Xenopus laevis (African clawed frog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ,* 

OC Amphibia; Batrachia; Anura ; Mesobatrachia ; Pipoidea; Pipidae; 

OC Xenopodinae; Xenopus; Xenopus. 

OX NCBI_TaxID=8 355; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=Whole; 

RX MEDLINE=22341132 ; PubMed=l 24 54 917 ; DOI=10 . 1002/dvdy . 1 0174 ; 

RA Klein S.L., Strausberg R.L., Wagner L., Pontius J., Clifton S.W., 

RA Richardson P.; 

RT "Genetic and genomic tools for Xenopus research: The NIH Xenopus 

RT initiative."; 

RL Dev. Dyn. 225:384-391(2 002). 

RN [2] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=Whole; 

RX MEDLINE=22 38 8257; PubMed=124 77932 ; DOI=10 . 1073/pnas . 242 6038 99 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T. , Max S.I., Wang J., Hsieh F . , 

RA Diatchenko L., Marusina K. , Farmer A. A. , Rubin G.M. , Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S. , Carninci P., Prange C. , 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S. , Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16 899-169 03(2002). 

RN [3] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=Whole; 

RA Klein S. , Gerhard D.S.; 

RL Submitted (JUL-2004) to the EMBL/GenBank/DDBJ databases. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 
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CC Distributed under the Creative Commons Attribut ion-NoDerivs License 

CC 

DR EMBL; BC07 85 98; AAH7 85 98.1; - ; mRNA. 

DR InterPro; IPR00620 7; Cys_knot__C. 

DR InterPro; IPR002400; GF_cysknot. 

DR InterPro; IPR008835; Sclerostin. 

DR PANTHER; PTHR149 03; Sclerostin; 1. 

DR Pfam; PF05463; Sclerostin; 1. 

DR PRINTS; PR0043 8; GFCYSKNOT. 

DR PROSITE; PS012 25; CTCK_2 ; 1. 

KW Hypothetical protein. 

FT NON_TER 1 1 

SQ SEQUENCE 187 AA; 21320 MW ; 46 70BF6C3F8D74 1A CRC64 ; 



Query Match 6 6.5%; Score 741.5; DB 2; Length 18 7; 

Best Local Similarity 75.5%; Pred . No. 9.6e-64; 

Matches 13 9; Conservative 15; Mismatches 21; Indels 9; Gaps 3; 

ATEILYSHVVKPVPAHPSSNSTLNQARNGGRHFSNTGLDRNTRVQVGCRELRSTKYISDG 87 

lllllll I I :-l III II II II h :|: III I I I I I I I I I I I I I I I I 

ATEILYSQVDKHT-QESANSSTLNQARNGGRNTANSALDRTNHHQVGCRELRSTKYISDG 6 0 



Qy 


28 


Db 


2 


Qy 


88 


Db 


61 


Qy 


148 


Db 


121 


Qy 


200 


Db 


181 



14 7 



Mill I II MM II II II IMI II II MM Mlllllllllllllllllllllllll 

QCTSI QPLKELVCAGECLPLP ILPNW IGGGYGLKYWSRRS SQEWRCVNDKTRTQRI QLQC 
QDGSTRTYKITWTACKCKRYTRQHNESSHNFESMSPAKPV- - - QHH RERKRASK 

HIM II M M II MM II II Ml II II II Ml M lh III MM 1 I 

EDGTTRTYKVTWTSCKCKRYTRQHNESSHNYEGASPIKPIHSQQHHHSHHNRDKKRLIK 

SSKH 203 
III 



RESULT 10 
Q56 8H6_BRARE 



ID 


Q5 68H6_BRARE PRELIMINARY; PRT ; 20 7 AA. 




AC 


Q568H6; 




DT 


10-MAY-2005, integrated into UniProtKB/TrEMBL . 




DT 


10-MAY-2005, sequence version 1. 




DT 


07-FEB-2006, entry version 4. 




DE 


Hypothetical protein zgc: 110293. 




GN 


ORFNames=zgc : 110293 ; 




OS 


Brachydanio rerio (Zebrafish) (Danio rerio) . 




OC 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 


Euteleostomi ; 


OC 


Actinopterygii ; Neopterygii; Teleostei; Ostariophysi 


; Cyprini f ormes ; 


OC 


Cyprinidae; Danio. 




OX 


NCBI TaxID=7955; 




RN 


[1] 




RP 


NUCLEOTIDE SEQUENCE. 




RC 


TISSUE=01f actory epithelium; 




RX 


MEDLINE=22388257; PubMed=l 24 77 932 ; DOI=10 . 1 073/pnas . 


242603899; 


RA 


Strausberg R.L., Feingold E . A. , Grouse L.H., Derge J 


.G. , 


RA 


Klausner R.D., Collins F.S., Wagner L., Shenmen CM. 


, Schuler G.D. , 


RA 


Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F 


'. , Bhat N.K. , 


RA 


Hopkins R.F., Jordan H., Moore T. , Max S.I., Wang J. 


, Hsieh F . , 


RA 


Diatchenko L., Marusina K. , Farmer A. A. , Rubin G.M., 


Hong L . , 


RA 


Staple ton M. , Soares M.B., Bonaldo M.F., Casavant T. 


L . , Scheetz T .E . , 


RA 


Brownstein M.J., Usdin T.B., Toshiyuki S. , Carninci 


P . , Prange C. , 


RA 


Raha S.S., Loquellano N.A. , Peters G.J., Abramson R. 


D . , Mullahy S . J. , 
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RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E . , Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A. , Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I. # Skalska U. , Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences . " ; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [2] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=01f actory epithelium; 

RG NIH MGC Project; 

RL Submitted (APR-2005) to the EMBL/GenBank/DDB J databases. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; BC092855; AAH92855.1; -; mRNA. 

DR Ensembl; ENSDARG0000002622 7 ; Danio rerio. 

DR ZFIN; ZDB-GENE- 050 417 - 64 ; zgc:110293. 

DR InterPro; IPR006207; Cys_knot_C. 

DR InterPro; IPR008835; Sclerostin. 

DR PANTHER; PTHR14903; Sclerostin; 1. 

DR Pfam; PF05463; Sclerostin; 1. 

DR PROSITE; PS01225; CTCK_2 ; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 207 AA; 23503 MW; 07 1B9F3C99A8D8EB CRC64 ; 

Query Match 57.8%; Score 645; DB 2; Length 207; 

Best Local Similarity 64.7%; Pred. No. 2.6e-54; 

Matches 13 0; Conservative 20; Mismatches 37; Indels 14; Gaps 6; 

Qy 3 PPAIHFYLLPLACILMKSCLAFKNDATEILYSHVVKPVPAHPSSNSTLNQARNGGRHFSN 62 

I : :| : I I h>| I lllllll Mill II IhMhlhlll II 

Db 6 PESCNF- -MVLFCFLIRSGLTLKNDATEIFYSHWSPV-QDAQSNASLNRARSGGRGFST 62 

Qy 63 TGLDRNTRVQVGCRELRSTKYISDGQCTSISPLKELVCAGECLPLPVLPNWIGGGYGTKY 122 

II h Mill MM II III II II Ihhl III hill MINI 1 1 1 1 I 

Db 63 H--DRE -RIPVGCRELRSTKYI SDGQCTSINPVKGLVCTGQCLPAQMLPNWI -GGYGKKS 118 

Qy 123 WSRRS SQEWRCVNDKTRTQRIQLQ CQDGSTRTYKITWTACKCKRYTRQHNESSHNFESM 182 

I M MM MINI II II II M III II II II M II MM MM II MM II II I I 

Db 119 WNRRNSQEWRCVNDKTRTQRIQLQ CQDGSTRTYKITWTSCKCKRYSRQHNESGVKSEGY 17 8 

Qy 183 SPAKPVQHHRERKRASKSSKH 203 

I I ' I' I I I 

Db 179 S HSQIKKTEKQSGH 192 



RESULT 11 

Q6 VYA 1_XENLA 

ID Q6VYA1_XENLA PRELIMINARY; PRT ; 15 3 AA. 
AC Q6VYA1; 

DT 05-JUL-2004, integrated into UniProtKB/TrEMBL . 
DT 05-JUL-2004, sequence version 1. 
DT 07-FEB-2006, entry version 7. 
DE Wise-B (Fragment). 
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OS Xenopus laevis (African clawed frog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Amphibia; Batrachia; Anura; Mesobatrachia ; Pi po idea; Pipidae; 

OC Xenopodinae; Xenopus; Xenopus. 

OX NCBI_TaxID=8355; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RX MEDLINE=22781984 ; PubMed=12900447 ; DOI=10 . 124 2/dev. 00674 ; 

RA Itasaki N. , Jones CM., Mercurio S. , Rowe A., Domingos P.M., 

RA Smith J.C., Krumlauf R.; 

RT "Wise, a context-dependent activator and inhibitor of Wnt 

RT signalling . " ; 

RL Development 13 0:42 95-4305(2003). 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribut ion-NoDerivs License 

CC 

DR EMBL; AY319928; AAQ57628.1; -; mRNA. 

DR InterPro; IPR008835; Sclerostin. 

DR PANTHER; PTHR14903; Sclerostin; 1. 

DR Pfam; PF0546 3; Sclerostin; 1. 

FT N0N_TER 15 3 153 

SQ SEQUENCE 153 AA; 17279 MW; E8C8 03C9277A8961 CRC64 ; 

Query Match 55.2%; Score 615.5; DB 2; Length 153; 

Best Local Similarity 74.7%; Pred. No. 1.4e-51; 

Matches 115; Conservative 15; Mismatches 23; Indels 1; Gaps 1; 

Qy 1 MLPPAIHFYLLPLACILMKSCLAFKNDATEILYSHVVKPVPAHPSSNSTLNQARNGGRHF 60 

\ :| : |||||::|| HIIIIIIIIII I I : : : I I I I I I I I I I h 

Db 1 M WSRLH CCMF YLAC IL I E SCTSF KNDATE IL YSQVDKHT - QES ANSS TLNQ ARNGGRNT 5 9 

Qy 61 SNTGLDRNTRVQVGCRELRSTKYISDGQCTSI SPLKELVCAGECLPLPVLPNWIGGGYGT 120 

:h III II II II II II III II II II II llllllllllllllhllllllllll 

Db 60 ANS ALDRTNHHQVGCRELRSTKYI SDGQCTSI QPLKELVCAGECLPLP IL PNWI GGGYGL 119 

Qy 121 KYWSRRSSQEWRCVNDKTRTQRIQLQCQDGSTRT 154 

I II II II II II II II II II II Ml II hlhl I 

Db 120 KYWSRRSSQEWRCVNDKTRTQRIQLQCEDGTTRT 153 



RESULT 12 
Q4RYK3JTETNG 

ID Q4RYK3JTETNG PRELIMINARY; PRT ; 179 AA . 

AC Q4RYK3; 

DT 19-JUL-2005, integrated into UniProt KB /TrEMBL . 

DT 19-JUL-2005, sequence version 1. 

DT 07-FEB-2006, entry version 4. 

DE Chromosome 2 SCAF14976, whole genome shotgun sequence. (Fragment). 

GN ORFNames=GSTENG00026927001 ; 

OS Tetraodon nigroviridis (Green puffer) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Actinopterygii ; Neopterygii; Teleostei; Euteleostei; Neoteleostei ; 

OC Acanthomorpha; Acanthopterygii ; Percomorpha; Tetraodont if ormes ; 

OC Tetradontoidea; Tetraodont idae ; Tetraodon. 

OX NCBI_TaxID=99883 ; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RX PubMed=154 96 914; DOI=10 . 1038 /nature 03 02 5 ; 

RA Jaillon O. , Aury J.-M., Brunet F. , Petit J.-L., Stange-Thomann N., 

RA Mauceli E. , Bouneau L. , Fischer C, Ozouf-Costaz C. , Bernot A., 
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RA Nicaud S., Jaffe D., Fisher S. , Lutfalla G. , Dossat C, Segurens B. , 

RA Dasilva C. , Salanoubat M . , Levy M. f Boudet N. , Castellano S., 

RA Anthouard V., Jubin C. , Castelli V., Katinka M. , Vacherie B., 

RA Biemont C. , Skalli Z., Cattolico L. , Poulain J. , De Berardinis V., 

RA Cruaud C, Duprat S., Brottier P., Coutanceau J. -P., Gouzy J., 

RA Parra G. , Lardier G. , Chappie C, McKernan K.J., McEwan P., Bosak S., 

RA Kellis M., Volff J.-N., Guigo R., Zody M.C., Mesirov J. , 

RA Lindblad-Toh K . , Birren B. , Nusbaum C. , Kahn D. , Robinson-Rechavi M., 

RA Laudet V., Schachter V., Quetier F. , Saurin W., Scarpelli C, 

RA Wincker P., Lander E.S., Weissenbach J. , Roest Crollius H.; 

RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals 

RT the early vertebrate proto -karyotype ." ; 

RL Nature 431:946-957(2 004). 

RN [2] 

RP NUCLEOTIDE SEQUENCE. 

RG Genoscope; Whitehead Institute Centre for Genome Research; 

RL Submitted (FEB-2004) to the EMBL/GenBank/DDB J databases. 

CC -!- CAUTION: The sequence shown here is derived from an 

CC EMBL/GenBank/DDB J whole genome shotgun (WGS) entry which is 

CC preliminary data. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 
CC Distributed under the Creative Commons Attribut ion-NoDerivs License 

CC 

DR EMBL; CAAE01 0149 76 / CAG06529.1; -; Genomic_DNA. 

DR InterPro; IPR0 0620 7; Cys_knot_C. 

DR InterPro; IPR0 02 40 0; GF_cysknot. 

DR InterPro; IPR008835; Sclerostin. 

DR PANTHER; PTHR149 03 ; Sclerostin; 1. 

DR Pfam; PF05463; Sclerostin; 1. 

DR PRINTS; PR0043 8; GFCYSKNOT. 

DR SMART; SM00041; CT ; 1. 

DR PROSITE; PS01225; CTCK_2 ; 1. 

FT NON_TER 17 9 179 

SQ SEQUENCE 179 AA; 19644 MW; C5 69E3 1E64 2A2D25 CRC64 ; 

Query Match 33.4%; Score 372.5; DB 2; Length 179; 

Best Local Similarity 44.6%; Pred . No. 7.1e-28; 

Matches 78; Conservative 31; Mismatches 45; Indels 21; Gaps 6; 

Qy 14 ACILMKSCL AFKND AT E I LY S HWKPVPAHPSSNSTLNQARNGGRHF 60 

I :|:: I I lllllhl I h llh hl-l I 

Db 13 ALVLL QG CC AAARGW KALKND ATE VL AD DRERSP HE PAAHAANAS SNAG -NRAKSGARST 71 

Qy 61 SNTGLDRNTRVQVGCRELRSTKYI SDGQCTSISPLKELVCAGECLPLPVLPNWIGGGYGT 12 0 

I : llllllhl-ll I I hllllhhIM ::|| II 
Db 72 STVSYSAS ELSCRELRSTRYVTDGSCRSAKPIKELVCSGQCLPAHLMPNTIG RA 125 

Qy 121 KYWSRRSSQEWRCVNDKTRTQRIQLQCQDGSTRTYKITWTACKCKRYTRQHNES 175 

hi I h hill :| :||lhl :hllllll IhlMlh Ih 
Db 126 KWW - RSSTSEYRCVP AHSRPRRI QLRC PNGNTRTYKI RTVTS CKCKRFRAHHNQS 179 

RESULT 13 
SOST^HUMAN 

ID SOST_HUMAN STANDARD; PRT ; 213 AA. 

AC Q9BQB4 ; 

DT 05-MAR-2002, integrated into UniProtKB/Swiss-Prot . 
DT 01-JUN-2001, sequence version 1. 
DT 07-MAR-2006, entry version 38. 
DE Sclerostin precursor. 
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GN Name=SOST; ORFNames=UNQ2 976/PR074 55/PR074 76 ; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chorda ta ; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontogli res; Primates; Catarrhini; Hominidae; 

OC Homo . 

OX NCBI_TaxID=9606; 

RN [1] 

RP NUCLEOTIDE SEQUENCE (I SO FORM 1) . 

RX PubMed=11181578; DOI =1 0 . 1093 /hmg/ 10 . 5 . 537 ; 

RA Balemans W., Ebeling M., Patel N. , van Hul E. , Olson P., Dioszegi M., 

RA Lacza C. , Wuyts W. , van den Ende J., Willems P., Paes-Alves A.F., 

RA Hill S., Bueno M., Ramos F.J., Tacconi P., Dikkers F.G., Stratakis C, 

RA Lindpaintner K., Vickery B., Foernzler D. , Van Hul W. ; 

RT "Increased bone density in sclerosteosis is due to the deficiency of a 

RT novel secreted protein (SOST)."; 

RL Hum. Mol . Genet. 10:537-543(2001). 

RN [2] 

RP NUCLEOTIDE SEQUENCE (ISOFORM 1). 

RX MEDLINE=21090529; PubMed=l 11 79 006 ; 

RA Brunkow M.E., Gardner J.C., Van Ness J., Paeper B.W., Kovacevich B.R., 

RA Proll S., Skonier J.E., Zhao L. f Sabo P.J., Fu Y.H., Alisch R.S., 

RA Gillett L. , Colbert T. , Tacconi P., Galas D., Hamersma H. , 

RA Beighton P., Mulligan J.T.; 

RT "Bone dysplasia sclerosteosis results from loss of the SOST gene 

RT product, a novel cystine knot-containing protein."; 

RL Am. J. Hum. Genet. 68:577-589(2001). 

RN [3] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 1 AND 2) . 

RX MEDLINE=22887296 ; PubMed=12975309 ; DOI= 10 . 1 10 1/gr . 129 30 03 ; 

RA Clark H.F., Gurney A.L., Abaya E. , Baker K. , Baldwin D.T., Brush J., 

RA Chen J., Chow B. , Chui C, Crowley C. , Currell B. , Deuel B. , Dowd P., 

RA Eaton D. , Foster J.S., Grimaldi C, Gu Q. , Hass P.E., Heldens S., 

RA Huang A., Kim H.S., Klimowski L., Jin Y., Johnson S., Lee J., 

RA Lewis L. , Liao D., Mark M.R., Robbie E. , Sanchez C. , Schoenfeld J . , 

RA Seshagiri S. , Simmons L. , Singh J., Smith V., Stinson J., Vagts A., 

RA Vandlen R.L., Watanabe C, Wieand D., Woods K. , Xie M.-H., 

RA Yansura D.G., Yi S., Yu G. , Yuan J., Zhang M. , Zhang Z. , Goddard A.D., 

RA Wood W.I., Godowski P.J., Gray A.M.; 

RT "The secreted protein discovery initiative (SPDI) , a large-scale 

RT effort to identify novel human secreted and transmembrane proteins: a 

RT bioinf ormatics assessment."; 

RL Genome Res. 13:2265-227 0(2003). 

RN [4] 

RP PROTEIN SEQUENCE OF 24-38. 

RX PubMed=15340161; DOI =1 0 . 11 10 /ps . 046 82 504 ; 

RA Zhang Z. , Henzel W.J.; 

RT "Signal peptide prediction based on analysis of experimentally 

RT verified cleavage sites."; 

RL Protein Sci. 13:2819-2824(2004). 

CC -!- FUNCTION: Seems to play a role in bone homeostasis. 

CC -!- SUBCELLULAR LOCATION: Secreted protein (Potential). 

CC - ! - ALTERNATIVE PRODUCTS : 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=l; 

CC IsoId=Q9BQB4-l; Sequence=Displayed ; 

CC Name=2; 

CC IsoId=Q9BQB4-2; Sequence =VSP_0 10 18 9 ; 

CC Note=No experimental confirmation available; 

CC -!- TISSUE SPECIFICITY: Widely expressed at low levels with highest 
CC levels in bone, cartilage, kidney, liver, bone marrow and primary 

CC osteeoblasts differentiated for 21 days. 
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CC -!- DISEASE: Defects in SOST are the cause of sclerosteosis 

CC [MIM : 269500] . Sclerosteosis is an autosomal recessive sclerosing 

CC bone dysplasia characterized by progressive skeletal overgrowth. 

CC The disorder is similar to van Buchem hyperostosis cortical is 

CC generalisata [MIM: 23 9100] , but differing in radiologic appearance 

CC of the bone changes and in the presence of asymmetric cutaneous 

CC syndactyly of the index and middle fingers in many cases. The 

CC majority of affected individuals have been reported in the 

CC Afrikaner population of South Africa, where a high incidence of 

CC the disorder occurs as a result of a founder effect. 

CC -!- SIMILARITY: Belongs to the sclerostin family. 

CC -!- SIMILARITY: Contains 1 CTCK (C- terminal cystine knot -like) domain. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribut ion-NoDerivs License 

CC 

DR EMBL; AF331844; AAK16158.1; -; mRNA. 

DR EMBL; AF326736; AAK13451.1; - ; GenomicJDNA. 

DR EMBL; AF326739; AAK13454.1; -; mRNA. 

DR EMBL; AY358203; AAQ88570.1; -; mRNA. 

DR EMBL; AY358627; AAQ88990.1; -; mRNA. 

DR Ensembl; ENSG00000 16 7941 ; Homo sapiens. 

DR HGNC; HGNC: 13771; SOST. 

DR MIM; 2 6950 0; phenotype . 

DR MIM; 6 05740; gene. 

DR GO; GO:0030279; P:negative regulation of ossification; NAS. 

DR InterPro; IPR006207; Cys_knot_C. 

DR InterPro; IPR008835; Sclerostin. 

DR PANTHER; PTHR14903; Sclerostin; 1. 

DR Pfam; PF05463; Sclerostin; 1. 

DR PROSITE; PS01185; CTCK_1 ; FALSE_NEG . 

DR PROSITE; PS01225; CTCK_2 ; FALSE_NEG . 

KW Alternative splicing; Direct protein sequencing; Glycoprotein; Signal. 



FT 


SIGNAL 


1 


23 




FT 


CHAIN 


24 


213 


Sclerostin . 


FT 








/FTId=PRO_0000033177 . 


FT 


DOMAIN 


82 


172 


CTCK. 


FT 


CARBOHYD 


53 


53 


N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


175 


175 


N-linked (GlcNAc. . .) (Potential). 


FT 


VARSPLIC 


64 


73 


RPPHHPFETK -> WPGGRPPSRAPLST (in isoform 


FT 








2) . 


FT 








/FTId=VSP_010189 . 


SQ 


SEQUENCE 


213 AA; 


24031 


MW; 30DBD55CE73D5BB2 CRC64 ; 



Query Match 31.3%; Score 34 9.5; DB 1; Length 213; 

Best Local Similarity 40.5%; Pred . No. 1.5e-25; 



Matches 85; Conservative 33; Mismatches 65; Indels 27; Gaps 10; 

Qy 11 LPLA CILMKSCL AFKNDATEIL- - YSH WKPVPAHPSSNSTLNQARN 55 

I I I I hh : MINIMI: :| | :| |:|:| | 

Db 3 LPLALCLVCLLVHTAFRWEGQGWQAFKNDATEIIPELGEYPEP-PPELENNKTMNRAEN 61 

Qy 56 GGR--HFSNTGLDRNTRVQVGCRELRSTKYISDGQCTSISPLKELVCAGECLPLPVLPNW 113 

Ml I I : : I I I I |:|::|| I I h lllhhl I MM 
Db 62 GGRPPHHPFETKDVS EYSCRELHFTRYVTDGPCRSAKP VTELVCSGQCGPARLLPNA 118 

Qy 114 IGGGYGTKYWSRRSSQEWRCVNDKTRTQRIQLQCQDG-STRTYKITVVTACKCKRYTRQH 172 

II I hill ::|h 1= I Ihll I I : I h =1 =11111 II I 

Db 119 IGRG KWW-RPSGPDFRCIPDRYRAQRVQLLCPGGEAPRARKVRLVASCKCKRLTRFH 174 

Qy 173 NES SHNFESMS PAKP VQHHRERKRAS KS SK 2 02 
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hi hi : : I II :hl 

Db 175 NQS EL KD FGTE AARP QKGR KP RPR A - RSA K 203 

RESULT 14 
Q49 5N9_HUMAN 

ID Q4 95N9_HUMAN PRELIMINARY; PRT ; 213 AA . 

AC Q4 95N9; 

DT 13-SEP-2005, integrated into UniProtKB/TrEMBL . 

DT 13-SEP-2005, sequence version 1. 

DT 07 -FEB -2006, entry version 3. 

DE Sclerostin, . 

GN Name=SOST; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi / 

OC Mammalia; Eutheria; Euarchontogli res ; Primates; Catarrhini; Hominidae; 

OC Homo . 

OX NCBI_TaxID=9606; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=PCR rescued clones; 

RX MEDLINE=22388257; PubMed=124 77 932 ; DOI= 10 . 1 07 3/pnas . 242 6038 99 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T. , Max S.I., Wang J., Hsieh F . , 

RA Diatchenko L., Marusina K. , Farmer A. A. , Rubin G.M. , Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Browns te in M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C. , 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E . , Ketteman M., Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W. , Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences." ; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [2] 

• RP NUCLEOTIDE SEQUENCE. 

RC TISSUE=PCR rescued clones; 

RG NIH MGC Project; 

RL Submitted (AUG-2005) to the EMBL/GenBank/DDB J databases. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attr ibut ion-NoDerivs License 

CC 

DR EMBL; BC101087; AAI01088.1; -; mRNA. 

DR EMBL; BC101088; AAI01089.1; -; mRNA. 

DR EMBL; BC101086; AAI01087.1; -; mRNA. 

DR EMBL; BC101089; AAI01090.1; -; mRNA. 

DR InterPro; IPR0 06 207; Cys_knot_C. 

DR Pfam; PF05463; Sclerostin; 1. 

SQ SEQUENCE 213 AA; 24031 MW ; 30DBD5 5CE73D5BB2 CRC64 ; 

Query Match 31.3%; Score 34 9.5; DB 2; Length 213; 

Best Local Similarity 40.5%; Pred . No. 1.5e-25; 
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Matches 85; Conservative 33; Mismatches 65; Indels 27; Gaps 10; 

Qy 11 LP LA CILMKSCL AFKNDATEIL- - YSH WKPVPAHPSSNSTLNQARN 55 

I I I I hh : llllllMh I I M |:|:| I 

Db 3 LPLALCLVCLLVHTAFRWEGQGWQAFKNDATEI IPELGEYPEP-PPELENNKTMNRAEN 61 

Qy 56 GGR- -HFSNTGLDRNTRVQVGCRELRSTKYISDGQCTSISPLKELVCAGECLPLPVLPNW 113 

III I I : : I I I I hh: II I I | = t I I I = I = I I MM 

Db 62 GGRPPHHPFETKDVS ---EYSCRELHFTRYVTDGPCRSAKPVTELVCSGQCGPARLLPNA 118 

Qy 114 I GGGYGTKYWSRRSSQEWRCVNDKTRTQRIQLQCQDG - STRTYKI TV VTACKCKRYTRQH 172 

II I Mill ::||: M | Ihll I : I M M MUM I I I 

Db 119 I GRG KWW - RPS GP DF RC I P DR YR AQ RVQL LC PGGE AP RARK VRLVA SC KG KRLT RFH 174 

Qy 173 NESSHNFESMS PAKP VQHHRERKRAS KS SK 2 02 

hi hi : : I II :hl 

Db 175 NQS EL KD FGTE AARP QKGR KP RPR A - RSAK 203 

RESULT 15 
SOST_CERAE 

ID SOST_CERAE STANDARD; PRT ; 213 AA. 

AC Q9BG78; 

DT 05-MAR-2002, integrated into Uni Prot KB /Swiss- Pro t . 

DT 01-JUN-2001, sequence version 1. 

DT 07-FEB-2006, entry version 26. 

DE Sclerostin precursor. 

GN Name=SOST; 

OS Cercopithecus aethiops (Green monkey) (Grivet). 

OC Eukaryota; Metazoa; Chorda ta ; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires ; Primates; Catarrhini; 

OC Cercopithecidae; Cercopi thecinae; Cercopithecus . 

OX NCBI_TaxID=953 4 ; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [MRNA] . 

RX MEDLINE=21090529; PubMed=l 11 79 006 ; 

RA Brunkow M.E., Gardner J.C., Van Ness J., Paeper B.W., Kovacevich B.R., 

RA Proll S., Skonier J.E., Zhao L . , Sabo P.J., Fu Y.H., Alisch R.S., 

RA Gillett L. , Colbert T. , Tacconi P., Galas D., Hamersma H. , 

RA Beighton P., Mulligan J.T.; 

RT "Bone dysplasia sclerosteosis results from loss of the SOST gene 

RT product, a novel cystine knot -containing protein."; 

RL Am. J. Hum. Genet. 68:577-589(2 001). 

CC -!- FUNCTION: Seems to play a role in bone homeostasis (By 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Secreted protein (Potential). 

CC -!- SIMILARITY: Belongs to the sclerostin family. 

CC -!- SIMILARITY: Contains 1 CTCK (C-terminal cystine knot-like) domain. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribut ion-NoDer ivs License 

CC 

DR EMBL; AF326742; AAK13457.1; -; mRNA. 

DR InterPro; IPR006207; Cys_knot_C. 

DR InterPro; IPR008835; Sclerostin. 

DR PANTHER; PTHR14903 ; Sclerostin; 1. 

DR Pfam; PF05463; Sclerostin; 1. 

DR PROSITE; PS01185; CTCK_1 ; FALSE_NEG . 

DR PROSITE; PS01225; CTCK_2 ; FALSE_NEG. 

KW Glycoprotein; Signal. 

FT SIGNAL 1 23 Potential. 
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FT CHAIN 24 213 Sclerostin. 

FT /FTId=PRO_0000033176 . 

FT DOMAIN 82 172 CTCK . 

FT CARBOHYD 53 53 N-linked (GlcNAc. . .) (Potential). 

FT CARBOHYD 175 175 N-linked (GlcNAc. . .) (Potential). 

SQ SEQUENCE 213 AA; 23 90 8 MW; 6DA7B5EDE6 74 72 8A CRC64 ; 

Query Match 31.2%; Score 34 8; DB 1; Length 213; 

Best Local Similarity 40.5%; Pred . No. 2.1e-25; 

Matches 83; Conservative 31; Mismatches 65; Indels 26; Gaps 9; 

Qy 11 LPLA CILMKSCL AFKNDATEIL - - YSHWKPVPAHPSSNSTLNQARN 55 

I I I I hh : MINIMI: :| | :| |:|:| | 

Db 3 LPLALCLVCLLVHAAFRWEGQGWQAFKNDATEI IPELGEYPEP-PPELENNKTMNRAEN 61 

Qy 56 GGR--HFSNTGLDRNTRVQVGCRELRSTKYISDGQCTSISPLKELVCAGECLPLPVLPNW 113 

Ml I I : : MM h h M I I I I 1 1 1 h I : I I MM 

Db 62 GGRPPHHPFETKDVS EYSCRELHFTRYVTDGPCRSAKPVTELVCSGQCGPARLLPNA 118 

Qy 114 IGGGYGTKYWSRRSSQEWRCVNDKTRTQRIQLQCQDGST-RTY KI TVVTACKCKRYTRQH 172 

M I Mill :»||: h I II M I I h I h • I MINI II I 
Db 119 IGRG KWW-RPSGPDFRCIPDRYRAQRVQLLCPGGAAPRARKVRLVASCKCKRLTRFH 174 

Qy 173 NESSHNFESMS PAKPVQHHRERKRA 19 7 

hi 1=1 : I II 

Db 175 NQS EL KD FG PE AARP QKGR KP RPR A 19 9 



Search completed: August 12, 2006, 07:00:54 
Job time : 303 sees 



SCORE 1.3 BuildDate: 12/06/2005 
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SCORE Search Results Details for Application 
10627604 and Search Result us-10-627-604- 
2,rag- 

Score Home Retrieve Application SCORE System SCORE Comments / 

Page List Overview FAQ Suggestions 

This page gives you Search Results detail for the Application 10627604 and Search Result us-10- 

627-604-2.rag. 

start 

Go Back to previous page 

GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence: 



August 12, 2006, 06:52:05 ; Search time 196 Seconds 

(without alignments) 
480.544 Million cell updates/sec 

US-10-627-604-2 
1115 

1 MLPPAIHFYLLPLACILMKS PVQHHRERKRASKSSKHSMS 206 



Scoring table: BLOSUM6 2 

Gapop 10.0 , Gapext 0 . 5 

Searched: 2589679 seqs, 457216429 residues 

Total number of hits satisfying chosen parameters: 



2589679 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 10 0% 
Listing first 45 summaries 



Database : A_Geneseq_8 : * 

1: geneseqpl980s : * 
2: geneseqpl990s : * 
3 : geneseqp2000s : * 
4: geneseqp2001s : * 
5 : geneseqp2002s :* 
6: geneseqp2003as: * 
7: geneseqp2003bs : * 
8: geneseqp2004s : * 
9 : geneseqp2005s : * 
10: geneseqp2006s : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 
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% 

Result Query 



No . 


Score 


Match 


Length 


DB 


ID 


Description 


1 


1115 


100 


.0 


206 


2 


AAW09408 


Aaw09408 


Human sma 


2 


1115 


100 


.0 


206 


2 


AAW9571 1 


Aaw95711 


Homo sapi 


3 


1115 


100 


. 0 


206 


3 


AAB10233 


Aabl0233 


Human adu 


4 


1115 


100 


.0 


206 


5 


ABB84851 


Abb84851 


Human PRO 


5 


1115 


100 


.0 


206 


5 


ABB09210 


Abb09210 


SCGF CNN 


6 


1115 


100 


.0 


206 


5 


ABB95457 


Abb95457 


Human ang 


7 


1115 


100 


.0 


206 


7 


ADD1035 9 


Addl0359 


Human sec 


8 


1115 


100 


.0 


206 


7 


ADD11 3 1 9 


Addll319 


Human sec 


9 


1115 


100 


.0 


206 


7 


ADD37 112 


Add37112 


Human sec 


10 


1115 


100 


.0 


206 


8 


ADE41320 


Ade41320 


Human sec 


11 


1 115 


100 


.0 


206 


8 


ADH43 503 


Adh43503 


Human PRO 


12 


1115 


100 


.0 


206 


8 


ADI27203 


Adi27203 


Human LRP 


13 


1115 


100 


.0 


206 


8 


ADK70477 


Adk70477 


Respirato 


14 


1115 


100 


.0 


206 


8 


ADK82848 


Adk82 84 8 


Human PRO 


15 


1115 


100 


.0 


206 


8 


ADQ26 750 


Adq26 75 0 


Form proc 


16 


1115 


100 


.0 


206 


9 


ADX07493 


Adx07493 


Cyclin-de 


17 


1115 


100 


.0 


206 


9 


ADY74 83 9 


Ady74 83 9 


Human par 


18 


1109 


99 


.5 


206 


2 


AAW58704 


Aaw58704 


Human sma 


19 


1109 


99 


.5 


206 


5 


AAU82653 


Aau82653 


Human bre 


2 0 


1109 


99 


.5 


206 


5 


ABB09200 


Abb09200 


Human sma 


21 


1088 


97 


.6 


23 0 


7 


ADB6544 8 


Adb65448 


Human pro 


2 2 


106 7 


95 


.7 


206 


3 


AAY76 03 1 


Aay76 031 


Murine sk 


23 


1067 


95 


.7 


206 


3 


AAY75 981 


Aay75981 


Murine sk 


24 


1067 


95 


.7 


206 


4 


AAB55 92 0 


Aab55920 


Skin cell 


25 


106 7 


95 


.7 


206 


4 


AAB55970 


Aab55970 


Skin cell 


26 


1067 


95 


.7 


206 


5 


ABB72170 


Abb7217 0 


Murine pr 


27 


1067 


95 


.7 


206 


5 


ABB72120 


Abb7212 0 


Murine pr 


28 


1067 


95 


.7 


206 


8 


ADS85127 


Ads85127 


Mouse a to 


29 


1067 


95 


.7 


206 


9 


ADY74 83 8 


Ady74 83 8 


Mouse par 


30 


1062 


95 


.2 


206 


8 


ADI27204 


Adi27204 


Rat LRP b 


3 1 


1058 


94 


.9 


206 


8 


ADI27 144 


Adi27144 


Mouse LRP 


32 


1055 


94 


.6 


206 


9 


ADY74 83 7 


Ady74 83 7 


Rat parti 


33 


992 


89 


.0 


183 


5 


ABB07213 


Abb07213 


Human clo 


34 


974 


87 


.4 


182 


5 


AAU82 655 


Aau82655 


Human bre 


35 


969 


86 


.9 


206 


8 


ADI27 15 1 


Adi27151 


Chicken L 


36 


719 


64 


.5 


13 4 


6 


ABR4743 8 


Abr47438 


Breast ca 


37 


719 


64 


.5 


134 


8 


ADS85025 


Ads85025 


Human a to 


3 8 


510.5 


45 


.8 


126 


8 


ADI27205 


Adi27205 


Zebraf ish 


39 


482. 5 


43 


.3 


116 


2 


AAW44 09 0 


Aaw44 09 0 


Human sec 


40 


482. 5 


43 


.3 


116 


2 


AAW27 654 


Aaw27654 


Secreted 


41 


406 


36 


.4 


77 


8 


ADI27214 


Adi27214 


Mouse LRP 


42 


355.5 


31 


.9 


213 


8 


ADI27150 


Adi2715 0 


Human LRP 


43 


349.5 


31 


.3 


213 


3 


AAY96429 


Aay96429 


Human TGF 


44 


349. 5 


31 


.3 


213 


3 


AAY96436 


Aay96436 


Human TGF 


45 


349.5 


31 


.3 


213 


3 


AAY96 43 0 


Aay9643 0 


Human TGF 



ALIGNMENTS 



RESULT 1 
AAW09408 

ID AAW094 08 standard; protein; 206 AA. 
XX 

AC AAW0 94 08; 
XX 

DT 03-SEP-1997 (first entry) 
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xx 

DE Human small CCN-like growth factor. 
XX 

KW CCN; human small CNN-like growth factor; SCGF; family; growth regulator; 
KW CTGF; connective tissue; cef 10; cry 61; nov; receptor; identify; 
KW antagonist; diagnosis; tumour; vascular disease; neovascularisation . 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 
FT Peptide 1. .23 

FT /label= signal_pept ide 

FT Protein 24. .206 

FT /label= mature_protein 

XX 

PN W09639486-A1. 
XX 

PD 12-DEC-1996. 
XX 

PF 05-JUN-1995; 95WO-US0 07 092 . 
XX 

PR 05-JUN-1995; 95WO-US0 07092 . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Hastings GA, Adams MD; 
XX 

DR WPI; 1997-043109/04. 
DR N-PSDB; AAT47661. 
XX 

PT DNA encoding human small CCN-like growth factor polypeptide - useful to 

PT diagnose, e.g. tumours. 

XX 

PS Claim 1; Fig 1; 61pp; English. 
XX 

CC The present sequence is that of human small CNN-like growth factor 

CC (SCGF) . SCGF is similar to the CCN family of growth regulators, where CCN 

CC stands for CTGF (connective tissue growth factor) , cef 10/cry 61, and 

CC nov. The SCGF receptor can be used to identify (ant ) agonists to SCGF. 

CC Disease or a susceptibility to disease related to a mutation of the DNA 

CC encoding SCGF (AAT47661) can be diagnosed by determining a mutation in 

CC the DNA. Detection of the mutation will allow diagnosis of, e.g. a 

CC tumour. An anti-SCGF antibody could be used to diagnose vascular disease 

CC or neovascularisation associated with tumour formation 

XX 

SQ Sequence 2 06 AA; 

Query Match 100.0%; Score 1115; DB 2; Length 206; 

Best Local Similarity 100.0%; Pred. No. 1.3e-109; 

Matches 20 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MLPPAIHFYLLPLACILMKSCLAFKNDATEILYSHVVKPVPAHPSSNSTLNQARNGGRHF 60 

I II II II II MM II II Ml I Ml II II II II II II II II II II II II II II II MM II 

Db 1 MLPPAIHFYLLPLACILMKSCLAFKNDATEILYSHVVKPVPAHPSSNSTLNQARNGGRHF 60 

Qy 61 SNTGLDRNTRVQVGCRELRSTKYI SDGQCTSI SPLKELVCAGECLPLPVLPNWIGGGYGT 12 0 

I M II II MM II II II II M Ml II II II II II II II II II II M MM M II MMII 

Db 61 SNTGLDRNTRVQVGCRELRSTKYI SDGQCTSI SPLKELVCAGECLPLPVLPNWIGGGYGT 12 0 

Qy 121 KYWSRRSSQEWRCVNDKTRTQRIQLQCQDGSTRTYKITWTACKCKRYTRQHNESSHNFE 18 0 

I II M M II MM II II M II III II II II II II II II II M II II II II II II II II II 
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SCORE Search Results Details for Application 
10627604 and Search Result us-10-627-604- 

2-rai. 



Score Home 
Page 



Retrieve Application 
List 



SCORE System 
Overview 



SCORE 
FAQ 



Comments / 
Suggestions 



This page gives you Search Results detail for the Application 10627604 and Search Result us-10- 

627-604-2.rai. 

start 



Go Back to previous page 



GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM protein - protein search, using sw model 
Run on: August 12, 2006, 07:01:10 ; 



Search time 50 Seconds 
(without alignments) 
360.626 Million cell updates/sec 



Title : 

Perfect score: 
Sequence: 

Scoring table: 



Searched: 



US-10-627-604-2 
1115 

1 MLPPAIHFYLLPLACILMKS . 



. PVQHHRERKRASKSSKHSMS 2 06 



BLOSUM6 2 
Gapop 10.0 



Gapext 0 . 5 



650591 seqs, 87530628 residues 



650591 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Issued_Patents_AA: * 

1 : /EMC_Celerra_SIDS3/ptodata/2/iaa/5_C0MB.pep: * 

2 : /EMC_Celerra_SIDS3/ptodata/2/iaa/6_COMB.pep: * 

3 : /EMC_Celerra_SIDS3/ptodata/2/iaa/7__COMB.pep: * 

4 : /EMC_Celerra_SIDS3/ptodata/2/iaa/H_COMB.pep: * 

5 : /EMC_Celerra_SIDS3/ptodata/2/iaa/PCTUS_COMB.pep: * 

6 : /EMC_Celerra_SIDS3 /ptodata/2 / iaa/RE_COMB .pep : * 

7 : /EMC_Celerra_SIDS3/ptodata/2/iaa/backf ilesl . pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 



Score 



% 

Query 

Match Length DB 



ID 



Description 
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1 


1115 


100 


.0 


206 


1 


US 


-08 


-468 


-847B-20 


Sequence 


20, Appl 


2 


1109 


99 


.5 


206 


1 


US 


-08 


-468 


-847B-2 


Sequence 


2, Appli 


3 


1088 


97 


.6 


230 


2 


US 


-10 


-104 


-047-3602 


Sequence 


3602, Ap 


4 


1067 


95 


.7 


206 


2 


US 


-09 


-188 


-930-159 


Sequence 


159, App 


5 


1067 


95 


.7 


206 


2 


US 


-09 


-188 


-930-286 


Sequence 


2 86, App 


6 


1067 


95 


.7 


206 


2 


US 


-09 


-312 


-283C-159 


Sequence 


15 9, App 


7 


1067 


95 


.7 


206 


2 


US 


-09 


-312 


-283C-286 


Sequence 


28 6, App 


8 


349.5 


31 


.3 


213 


2 


US 


-09 


-449 


-218D-2 


Sequence 


2, Appli 


9 


349.5 


31 


.3 


213 


2 


US 


-09 


-449 


-218D-6 


Sequence 


6, Appli 


10 


349.5 


31 


.3 


213 


2 


US 


-09 


-449 


-218D-8 


Sequence 


8, Appli 


11 


349.5 


31 


.3 


213 


2 


US 


-09 


-668 


-529A-2 


Sequence 


2, Appli 


12 


349.5 


31 


.3 


213 


2 


US 


-09 


-668 


-529A-6 


Sequence 


6, Appli 


13 


349.5 


31 


.3 


213 


2 


us 


-09 


-668 


-529A-8 


Sequence 


8, Appli 


14 


349.5 


31 


.3 


213 


2 


us 


-09 


-668 


-037A-2 


Sequence 


2, Appli 


15 


349. 5 


31 


.3 


213 


2 


us 


-09 


-668 


-037A-6 


Sequence 


6, Appli 


16 


349.5 


31 


.3 


213 


2 


us 


-09 


-668 


-037A-8 


Sequence 


8, Appli 


17 


349.5 


31 


.3 


213 


2 


us 


-09 


-668 


-021-2 


Sequence 


2 , Appl i 


18 


349.5 


31 


.3 


213 


2 


us 


-09 


-668 


-021-6 


Sequence 


6 , Appl i 


19 


349.5 


31 


.3 


213 


2 


us 


-09 


-668 


-021-8 


Sequence 


8 , Appl i 


20 


348 


31 


.2 


213 


2 


us 


-09 


-449 


-218D-10 


Sequence 


10, Appl 


21 


348 


31 


.2 


213 


2 


us 


-09 


-668 


-529A-10 


Sequence 


10, Appl 


22 


348 


31 


.2 


213 


2 


us 


-09 


-668 


-037A-10 


Sequence 


10 , Appl 


23 


348 


31 


.2 


213 


2 


us 


-09 


-668 


-021-10 


Sequence 


10, Appl 


24 


340 


30 


.5 


213 


2 


us 


-09 


-449 


-218D-14 


Sequence 


14, Appl 


25 


340 


30 


.5 


213 


2 


us 


-09 


-668 


-529A-14 


Sequence 


14, Appl 


26 


340 


30 


.5 


213 


2 


us 


-09 


-668 


-037A-14 


Sequence 


14, Appl 


27 


340 


30 


.5 


213 


2 


us 


-09 


-668 


-021-14 


Sequence 


14, Appl 


28 


333 


29 


.9 


176 


2 


us 


-09 


-449 


-218D-16 


Sequence 


16 , Appl 


29 


333 


29 


.9 


176 


2 


us 


-09 


-668 


-529A-16 


Sequence 


16, Appl 


30 


333 


29 


.9 


176 


2 


us 


-09 


-668 


-037A-16 


Sequence 


16 , Appl 


31 


333 


29 


.9 


176 


2 


us 


-09 


-668 


-021-16 


Sequence 


16, Appl 


32 


333 


29 


.9 


211 


2 


us 


-09 


-449 


-218D-12 


Sequence 


12 , Appl 


33 


333 


29 


.9 


211 


2 


us 


-09 


-668 


-529A-12 


Sequence 


12 , Appl 


34 


333 


29 


.9 


211 


2 


us 


-09 


-668 


-037A-12 


Sequence 


12 , Appl 


35 


333 


29 


.9 


211 


2 


us 


-09 


-668 


-021-12 


Sequence 


1 2 , App 1 


36 


95. 5 


8 


.6 


375 


1 


us 


-08 


-468 


-847B-13 


Sequence 


13, Appl 


37 


95.5 


8 


.6 


375 


2 


us 


-09 


-495 


-448A-33 


Sequence 


33, Appl 


38 


95 


8 


.5 


379 


1 


us 


-08 


-468 


-847B-11 


Sequence 


11, Appl 


39 


95 


8 


.5 


379 


2 


us 


-09 


-142 


-569-2 


Sequence 


2 , Appl i 


40 


95 


8 


.5 


379 


2 


us 


-09 


-495 


-448A-2 


Sequence 


2, Appli 


41 


95 


8 


.5 


379 


2 


us 


-10 


-099 


-322-45 


Sequence 


4 5, Appl 


42 


95 


8 


.5 


379 


2 


us 


-10 


-044 


-564-45 


Sequence 


45, Appl 


43 


91.5 


8 


.2 


735 


2 


us 


-09 


-191 


-647-9 


Sequence 


9 , Appl i 


44 


91.5 


8 


.2 


735 


2 


us 


-09 


-540 


-245A-9 


Sequence 


9, Appli 


45 


91.5 


8 


.2 


735 


2 


us 


-09 


-540 


-153-9 


Sequence 


9 , Appl i 



ALIGNMENTS 



RESULT 1 

US-08-468-847B-20 

; Sequence 20, Application US/08468847B 

; Patent No. 5780263 

; GENERAL INFORMATION: 

; APPLICANT: Hastings, Gregg A. and Adams, Mark D. 
TITLE OF INVENTION: Human CCN-Like Growth Factor 
NUMBER OF SEQUENCES: 20 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: CARELLA, BYRNE, BAIN, GILFILIAN, 
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ADDRESSEE: CECCHI , STEWART & OLSTEIN 
STREET: 6 BECKER FARM ROAD 
CITY : ROSELAND 
STATE: NEW JERSEY 
COUNTRY: USA 
ZIP: 07068 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 3.5 INCH DISKETTE 
COMPUTER: IBM PS/2 
OPERATING SYSTEM: MS-DOS 
SOFTWARE : WORD PERFECT 5 . 1 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/468, 847B 
FILING DATE: 6 June 1995 
CLASSIFICATION: 4 35 
PRIOR APPLICATION DATA: 
APPLICATION NUMBER: 
FILING DATE: 
ATTORNEY /AGENT INFORMATION: 
NAME: MULLINS, J.G. 
REGISTRATION NUMBER: 33,073 
REFERENCE/ DOCKET NUMBER: 325800-442 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 201-994-1700 
TELEFAX : 201-994-174 4 
INFORMATION FOR SEQ ID NO: 20: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 2 06 AMINO ACIDS 
TYPE: AMINO ACID 
STRANDEDNESS: 
TOPOLOGY: LINEAR 
MOLECULE TYPE: PROTEIN 
US-08-468-847B-20 

Query Match 100.0%; Score 1115; DB 1; Length 206; 

Best Local Similarity 100.0%; Pred. No. 6.2e-118; 

Matches 206; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MLPPAIHFYLLPLACILMKSCLAFKNDATEILYSHVVKPVPAHPSSNSTLNQARNGGRHF 60 

MIIIMIIII II II MM Mill llllll II llllllll llllll II II MM llllll 

Db 1 MLPPAIHFYLLPLACILMKSCLAFKNDATEILYSHVVKPVPAHPSSNSTLNQARNGGRHF 60 

Qy 61 SNTGLDRNTRVQVGCRELRSTKYI SDGQCTSI SPLKELVCAGECLP LP VLPNWIGGGYGT 12 0 

I II II II II M M M II II II III II II II II II II II II II II II II II II II II II II 
Db 61 SNTGLDRNTRVQVGCRELRSTKYI SDGQCTSI SPLKELVCAGECLPLP VLPNWIGGGYGT 12 0 

Qy 121 KYWSRRSSQEWRCVNDKTRTQRIQLQCQDGSTRTYKITWTACKCKRYTRQHNESSHNFE 180 

1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 ! 1 1 i 1 1 1 1 1 1 [ I 

Db 121 KYWSRRSSQEWRCVNDKTRTQRIQLQCQDGSTRTYKITWTACKCKRYTRQHNESSHNFE 180 

Qy 181 SMS PAKPVQHHRERKRASKSS KHSMS 2 06 

I II II II M II II II II MM Ml II 

Db 181 SMS PAKP VQHHRE RKRASKSS KHS MS 2 06 



RESULT 2 

US-08-468-847B-2 

; Sequence 2, Application US/08468847B 

; Patent No. 5780263 

; GENERAL INFORMATION: 

APPLICANT: Hastings, Gregg A. and Adams, Mark D. 
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SCORE Search Results Details for Application 
10627604 and Search Result us-10-627-604- 
2.rapbm. 



Score Home 
Page 



Retrieve Application 
List 



SCORE System 
Overview 



SCORE 
FAQ 



Comments / 
Suggestions 



This page gives you Search Results detail for the Application 10627604 and Search Result us-10- 

627-604-2. rapbm. 

start 

Go Back to previous page 

GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM protein - protein search, using sw model 

Run on: August 12, 2006, 07:13:05 ; Search time 177 Seconds 

(without alignments) 
539.109 Million cell updates/sec 

US-10-627-604-2 
Perfect score: 1115 

1 MLP PA IH FY LL PL AC ILMKS PVQHHRERKRAS KS SKHSMS 2 06 



Title: 
Per fee 
Sequence : 

Scoring table: 
Searched: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 

2097797 seqs, 463214858 residues 



2097797 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 10 0% 
Listing first 45 summaries 

Database : Published_Applications_AA_Main: * 

1 : /EMC_Celerra_SIDS3 /ptodata/2 /pubpaa/US07_PUBCOMB .pep : * 

2 : /EMC_Celerra_SIDS3 /ptodata/2 /pubpaa/US08_PUBCOMB .pep : * 

3 : /EMC_Celerra_SIDS3 /ptodata/2 /pubpaa/US09_PUBCOMB .pep : * 

4 : /EMC_Celerra_SIDS3/ptodata/2/pubpaa/US10A_PUBCOMB.pep:* 

5 : /EMC_Celerra_SIDS3/ptodata/2/pubpaa/US10B_PUBCOMB.pep:* 

6 : /EMC_Celerra_SIDS3/ptodata/2/pubpaa/USll_PUBCOMB .pep:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB 



ID 



Description 



http://es/ScoreAccessWeb/GetItem.action?AppId=10627604&seqId=583749&Item^ 8/15/06 



SCORE Search Results Details for Application 10627604 and Search Result us- 10-627-6... Page 2 of 21 



1 


1115 


100 


. 0 


206 


3 


US - 


09 


-853 


-625B-20 


Sequence 


20, Appl 


2 


1115 


100 


. 0 


206 


4 


US - 


10 


-223 


-085 


-70 


Sequence 


7 0 , App 1 


3 


1115 


100 


. 0 


206 


4 


us - 


10 


-223 


-084 


-70 


Sequence 


7 0 , App 1 


4 


1115 


100 


. 0 


206 


4 


us - 


10 


-223 


-088 


-70 


Sequence 


7 0 , App 1 


5 


1115 


100 


. 0 


206 


4 


us - 


10 


-223 


-090 


-70 


Sequence 


70, Appl 


6 


1115 


100 


. 0 


206 


4 


us - 


10 


-223 


-087 


-70 


Sequence 


70, Appl 


7 


1115 


100 


. 0 


206 


4 


us - 


10 


-223 


-083 


-70 


Sequence 


70, Appl 


8 


1115 


100 


. 0 


206 


4 


us - 


10 


-223 


-089 


-70 


Sequence 


7 0, Appl 


9 


1115 


100 


. 0 


206 


4 


us - 


10 


-223 


- 081 


-70 


Sequence 


70, Appl 


1 0 


1115 


100 


. 0 


206 


4 


us - 


10 


-223 


-082 


-70 


Sequence 


70, Appl 


1 1 


1115 


100 


. 0 


206 


4 


us - 


10 


-305 


-654 


-70 


Sequence 


7 0 , App 1 


12 


1115 


100 


. 0 


206 


4 


us - 


10 


-464 


-368 


-104 


Sequence 


1 04 , App 


13 


1115 


100 


. 0 


206 


4 


us - 


10 


-081 


-056 


-70 


Sequence 


7 0 , App 1 


14 


1115 


100 


. 0 


206 


4 


us - 


10 


-627 


-604 


-2 


Sequence 


2 , Appl i 


15 


1115 


100 


. 0 


206 


5 


us - 


10 


-627 


-604 


-2 


Sequence 


2 , Appl i 


16 


1109 


99 


. 5 


206 


3 


us - 


09 


-853 


-625B-2 


Sequence 


2 , App 1 i 


17 


1109 


99 


. 5 


206 


4 


us - 


10 


-453 


-919 


-119 


Sequence 


119, App 


18 


108 8 


97 


. 6 


23 0 


4 


us - 


10 


-104 


-047 


-3602 


Sequence 


3 602, Ap 


19 


108 8 


97 


. 6 


23 0 


6 


us - 


11 


-072 


-512 


-3602 


Sequence 


3 602, Ap 


2 0 


1067 


95 


. 7 


206 


3 


us - 


09 


-866 


-050A-159 


Sequence 


15 9, App 


2 1 


1067 


95 


. 7 


206 


3 


us - 


09 


-866 


-050A-286 


Sequence 


28 6, App 


22 


1062 


95 


. 2 


206 


4 


us - 


10 


-464 


-368 


-105 


Sequence 


1 05 , App 


23 


105 8 


94 


. 9 


206 


4 


us - 


10 


-464 


-368 


-45 


Sequence 


4 5 , App 1 


24 


992 


89 


. 0 


183 


3 


us - 


09 


-867 


-274 


-25 


Sequence 


2 5 , App 1 


25 


992 


89 


. 0 


183 


4 


us - 


10 


-679 


-670A-25 


Sequence 


25, Appl 


26 


98 0 


87 


. 9 


182 


4 


us - 


10 


-453 


-919 


-121 


Sequence 


121, App 


27 


969 


86 


. 9 


206 


4 


us - 


10 


-464 


-368 


-52 


Sequence 


52, Appl 


28 


719 


64 


. 5 


134 


4 


us - 


10 


-177 


-293 


-108 


Sequence 


108, App 


29 


510 . 5 


45 


. 8 


126 


4 


us - 


10 


-464 


-368 


-106 


Sequence 


106, App 


30 


406 


36 


. 4 


77 


4 


us - 


10 


-464 


-368 


-115 


Sequence 


1 15 , App 


3 1 


355 . 5 


31 


. 9 


213 


4 


us - 


10 


-464 


-368 


-51 


Sequence 


51, Appl 


32 


349 . 5 


31 


. 3 


213 


3 


us - 


09 


-867 


-274 


-5 


Sequence 


5 , Appli 


3 3 


349.5 


31 


. 3 


213 


4 


us - 


10 


-245 


-752 


-64 


Sequence 


64, Appl 


34 


349.5 


31 


. 3 


213 


4 


us - 


10 


-245 


-859 


-64 


Sequence 


6 4 , App 1 


3 5 


349.5 


3 1 


. 3 


213 


4 


us - 


10 


-245 


-103 


-64 


Sequence 


64, Appl 


3 6 


349.5 


3 1 


. 3 


213 


4 


us - 


10 


-245 


-107 


-64 


Sequence 


64, Appl 


3 7 


349.5 


3 1 


. 3 


213 


4 


us - 


10 


-245 


-143 


-64 


Sequence 


6 4 , App 1 


3 8 


349.5 


31 


. 3 


213 


4 


US- 


10 


-245 


-771 


-64 


Sequence 


6 4 , App 1 


39 


349.5 


31 


. 3 


213 


4 


us - 


10 


-245 


-851 


-64 


Sequence 


64, Appl 


4 0 


349.5 


31 


. 3 


213 


4 


us - 


10 


-245 


-883 


-64 


Sequence 


6 4 , App 1 


41 


349. 5 


31 


.3 


213 


4 


us- 


10 


-237 


-535 


-64 


Sequence 


6 4 , App 1 


42 


349. 5 


31 


.3 


213 


4 


us- 


10 


-238 


-183 


-64 


Sequence 


6 4 , App 1 


43 


349. 5 


31 


.3 


213 


4 


us- 


10 


-238 


-283 


-64 


Sequence 


6 4 , App 1 


44 


349.5 


31 


.3 


213 


4 


us- 


10 


-238 


-370 


-64 


Sequence 


6 4 , App 1 


45 


349.5 


31 


.3 


213 


4 


us- 


10 


-245 


-055 


-64 


Sequence 


64, Appl 



ALIGNMENTS 



RESULT 1 

US-09-853 -625B-20 

; Sequence 20, Application US/09853625B 
; Patent No. US20020049304A1 

GENERAL INFORMATION: 
; APPLICANT: Hastings, Gregg A. and Adams, Mark D. 

TITLE OF INVENTION: Human CCN-Like Growth Factor 
NUMBER OF SEQUENCES: 2 0 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: CARELLA, BYRNE, BAIN, GILFILIAN, 
CECCHI, STEWART & OLSTEIN 
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STREET: 6 BECKER FARM ROAD 

CITY: ROSELAND 

STATE: NEW JERSEY 

COUNTRY: USA 

ZIP: 07068 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 3.5 INCH DISKETTE 

COMPUTER: IBM PS/2 

OPERATING SYSTEM: MS-DOS 

SOFTWARE: WORD PERFECT 5.1 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/0 9/ 853 , 62 5B 

FILING DATE: 14 -May -2001 

CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 09/053,587 

FILING DATE: 
ATTORNEY /AGENT INFORMATION: 

NAME: MULLINS , J.G. 

REGISTRATION NUMBER: 33,073 

REFERENCE/DOCKET NUMBER: 325800-442 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 201-994-1700 

TELEFAX: 201-994-1744 
INFORMATION FOR SEQ ID NO: 20: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 2 06 AMINO ACIDS 

TYPE: AMINO ACID 

STRANDEDNESS: 

TOPOLOGY: LINEAR 
MOLECULE TYPE: PROTEIN 

SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
US-09-853 -625B-20 

Query Match 100.0%; Score 1115; DB 3; Length 206; 

Best Local Similarity 100.0%; Pred. No. 2.5e-106; 

Matches 206; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MLPPAIHFYLLPLACILMKSCLAFKNDATEILYSHVVKPVPAHPSSNSTLNQARNGGRHF 60 

I II MM II II MM II II Mill Ml! II II II II INI II II II II II llllll II II 

Db 1 MLPPAIHFYLLPLACILMKSCLAFKNDATEILYSHVVKPVPAHPSSNSTLNQARNGGRHF 60 

Qy 61 SNTGLDRNTRVQVGCRELRSTKYI SDGQ CT SI SPLKELVCAGECLPLPVLPNWI GGGYGT 12 0 

I II II II II II II II II II II III II II II II II II II II II II II II II II II II II II 
Db 61 SNTGLDRNTRVQVGCRELRSTKYI SDGQCTSI SPLKELVCAGECLPLPVLPNWI GGGYGT 120 

Qy 121 KYWSRRS SQEWRCVNDKTRTQRIQLQCQDGSTRTYKITWTACKCKRYTRQHNESSHNFE 18 0 

I II II M II II II M II II II III II II II II M II M II II II M II II II II M II II 

Db 121 KYWSRRS SQEWRCWDKTRTQRIQLQCQDGSTRTYKITWTACKCKRYTRQHNESSHNFE 180 

Qy 181 SMSPAKPVQHHRERKRASKSSKHSMS 2 06 

II II II II II II M II II II III II 
Db 181 SMSPAKPVQHHRERKRASKSSKHSMS 2 06 



RESULT 2 

US-10-223-085-70 

; Sequence 70, Application US/10223085 

; Publication No. US20030 1004 97A1 

; GENERAL INFORMATION: 

; APPLICANT: Baker, Kevin P. 
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SCORE Search Results Details for Application 1C 
and Search Result us-10-627-604-2.rapbi 



Score Home Page Retrieve Application List SCORE System Overview SCORE FAQ Comments / Sugg 

This page gives you Search Results detail for the Application 10627604 and Search Result us-10-6; 
start 

Go Ba 



GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



August 12, 2006, 07:13:40 



US-10-627-604-2 
1115 

1 MLP PA IH FY LL PL AC I LMKS . 



; Search time 32 Seconds 
(without alignments) 
433.321 Million cell updates/sec 



. PVQHHRERKRASKSSKHSMS 2 06 



BLOSUM6 2 
Gapop 10.0 



Gapext 0 . 5 



239914 seqs, 67312017 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 10 0% 
Listing first 45 summaries 



239914 



Database 



Published_Application 
SIDS3 
SIDS3 
SIDS3 
SIDS3 
SIDS3 
SIDS3 
SIDS3 
SIDS3 



/EMC_Celerra_ 
/EMC_Celerra_ 
/EMC_Celerra~ 
/EMC_Celerra_ 
/EMC_Celerra_ 
/EMC_Celerra_ 
/EMC_Celerra_ 
/EMC_Celerra_ 



s_AA_New : * 

/p toda ta /2 /pubpaa/US 09_NEW_PUB . pep : * 
/ptodata /2 /pubpaa/US 06_NEW_PUB .pep : * 
/ptodata /2 /pubpaa/US 07_NEW_PUB .pep : * 
/ptodata /2 /pubpaa/US 08_NEW_PUB .pep : * 
/ptodata/2/pubpaa/PCT_NEW_PUB.pep: * 
/ptodata/2/pubpaa/US10_NEW_PUB .pep: * 
/ptodata/2/pubpaa/USll_NEW_PUB .pep : * 
/ptodata /2 /pubpaa/US 60_NEWJ?UB .pep : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 



Score 



% 

Query 

Match Length DB 



ID 



Description 



1 118.5 10.6 2233 6 US-10-544 -731 -9 

2 102.5 9.2 2202 6 US-10-544 -731 -8 



Sequence 9, Appli 
Sequence 8, Appli 
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3 


102 . 5 


9 


. 2 


222 7 


6 


US - 


10 


-544 


-731 -7 


Sequence 


7 , Appl i 


4 


89 


8 


. 0 


984 


6 


US - 


10 


-449 


-902 -564 54 


Sequence 


5 64 54 , A 


5 


84 


7 


. 5 


250 


7 


us - 


11 


-056 


-355B-7514 


Sequence 


75 14 , Ap 


6 


82 


7 


. 4 


5179 


7 


us - 


11 


-105 


-233 -185 


Sequence 


IOC « nn 

185, App 


7 


80 


7 


. 2 


422 


6 


us - 


10 


-449 


-902 -442 14 


Sequence 


44214 , A 


8 


7 7 


6 


. 9 


29 8 


6 


us - 


10 


-449 


ft ft ft f> r\f-i r\ a 

- 902 -397 08 


Sequence 


ft Q T ft O jl 


9 


77 


6 


. 9 


35 9 


6 


us - 


10 


-511 


r\ ft o ft ^ n 

-937 -293 2 


Sequence 


ft ft ft ft 7V _ 

2 932, Ap 


1 0 


7 7 


6 


. 9 


35 9 


6 


us - 


10 


-511 


ft ft ft ft ft c C 

-937 -295 5 


Sequence 


ft ft C C 7\ 

2 95b, Ap 


1 1 


77 


6 


. 9 


50 0 


7 


us - 


11 


-254 


-195 - 6 


Sequence 


6 , Appl i 


12 


76.5 


6 


. 9 


83 8 


6 


us - 


10 


-449 


oat, yicoco 

-902 -452 52 


Sequence 


4 52 52 , A 


13 


74.5 


6 


. 7 


354 


7 


us - 


11 


-056 


-355B- 167 95 


Sequence 


16795, A 


14 


74.5 


6 


. 7 


356 


7 


us - 


11 


-056 


-355B- 167 94 


Sequence 


16 794 , A 


15 


74 . 5 


6 


. 7 


38 6 


7 


us - 


11 


- 056 


-355B- 167 93 


Sequence 


16793, A 


16 


74.5 


6 


. 7 


78 0 


6 


us - 


10 


-449 


ft ft ft a a ft, ft 0 

- 902 -440 82 


Sequence 


4 4082, A 


17 


73 


6 


. 5 


468 


7 


us - 


11 


-293 


-697 -4473 


Sequence 


4 473, Ap 


18 


72.5 


6 


. 5 


723 


7 


us - 


11 


-289 


-102 -290 


Sequence 


2 90, App 


1 9 


71 


6 


. 4 


215 


6 


us - 


10 


-953 


lift 1 1 ft, "i """J 

-349-310 32 


Sequence 


ft -i ft ft ft TV 

31032, A 


2 0 


7 1 


6 


. 4 


224 


6 


us - 


10 


-449 


ft ft ft <-i r\ r\ a 

-902 -37984 


Sequence 


3 7984 , A 


2 1 


7 1 


6 


. 4 


224 


6 


us - 


10 


-449 


ft ft ft a ft a ft ft 

- 902 -494 22 


Sequence 


4 94 22 , A 


22 


7 1 


6 


. 4 


62 7 


7 


us - 


11 


- 056 


- 355B- 709 95 


Sequence 


70995, A 


23 


7 1 


6 


. 4 


74 7 


7 


us - 


11 


- 056 


-355B-709 94 


Sequence 


70994, A 


24 


7 1 


6 


. 4 


783 


7 


us - 


11 


- 056 


ft r- r~ n ft ft m 

-355B-709 93 


Sequence 


»7 ft ft ft ft TV 

70993, A 


25 


70.5 


6 


. 3 


39 0 


6 


us - 


10 


- 953 


-349 -244 21 


Sequence 


2 44 21, A 


26 


70.5 


6 


. 3 


390 


7 


us - 


11 


- 056 


-355B-575 08 


Sequence 


57508, A 


27 


70.5 


6 


. 3 


392 


6 


us - 


10 


-953 


-349 -244 20 


Sequence 


244 20, A 


28 


70.5 


6 


. 3 


392 


7 


us - 


11 


-056 


-355B-575 07 


Sequence 


57507, A 


29 


70 . 5 


6 


. 3 


590 


6 


us - 


10 


-953 


-349 -988 1 


Sequence 


9 881, Ap 


3 0 


70.5 


6 


. 3 


590 


7 


us - 


11 


-056 


-355B-20740 


Sequence 


ft ft ftT X ft TV 

20740, A 


3 1 


70.5 


6 


. 3 


604 


6 


us - 


10 


-953 


-349 -9880 


Sequence 


9 880, Ap 


32 


70.5 


6 


. 3 


604 


7 


us - 


11 


-056 


-355B-20739 


Sequence 


ft ft »-? ft ft Tl 

20739, A 


3 3 


70.5 


6 


. 3 


60 7 


6 


us - 


10 


- 953 


ft a ft r\ n *7 o. 

-349 - 987 9 


Sequence 


ft ft ^T ft TV _. 

9 879, Ap 


34 


70.5 


6 


. 3 


60 7 


7 


us - 


11 


- 056 


-\ ^ r— t"\ o ft n ^ n 

-355B- 207 38 


Sequence 


ft ft *T ft O TV 

20738, A 


35 


70.5 


6 


. 3 


84 7 


7 


us - 


11 


-174 


ft ft »-7 »-} ft 

-307B-70 


Sequence 


70, Appl 


3 6 


6 9.5 


6 


. 2 


204 


7 


us - 


11 


-330 


^ ft ft ft x i r- 

-403 -8415 


Sequence 


8415, Ap 


3 7 


6 9.5 


6 


. 2 


364 


7 


us - 


11 


- 056 


ft r- r- t-j ft ft ft ft 1 

-355B- 200 81 


Sequence 


ft ft ft ft -1 * 

20 081, A 


3 8 


6 9.5 


6 


. 2 


382 


7 


us - 


11 


-056 


<s C r r> ft ft ft ft ft 

- 355B-200 80 


Sequence 


ft ft ft ft ft tv 

20080, A 


3 9 


69.5 


6 


. 2 


405 


7 


us - 


11 


- 056 


-355B-20079 


Sequence 


20079, A 


A A 


6 9.5 


6 


. 2 


89 6 


7 


us - 


11 


- 056 


nrPTj T A 1 A/1/1 

- Sddd- 101 04 4 


Sequence 


1 AT A /I /I 

1U 10 44 , 


41 


69.5 


6 


.2 


896 


7 


us- 


11 


-056 


-355B-112283 


Sequence 


112283, 


42 


69.5 


6 


.2 


901 


7 


us- 


11 


-056 


-355B-101043 


Sequence 


101043, 


43 


69.5 


6 


.2 


901 


7 


us- 


11 


-056 


-355B-112282 


Sequence 


112282 , 


44 


69.5 


6 


.2 


902 


7 


us- 


11 


-056 


-355B-101042 


Sequence 


101042, 


45 


69.5 


6 


.2 


902 


7 


us- 


11 


-056 


-355B-112281 


Sequence 


112281, 



ALIGNMENTS 



RESULT 1 
US-10-544-731-9 

; Sequence 9, Application US/10544731 
; Publication No. US2 00 6 015 02 62 Al 
; GENERAL INFORMATION : 

; APPLICANT: Applied Research Systems ARS Holding N.V. 
; TITLE OF INVENTION: NOVEL MUCIN-LIKE POLYPEPTIDES 

FILE REFERENCE: 82 5-PCT 
/ CURRENT APPLICATION NUMBER: US/10/5 44,731 
; CURRENT FILING DATE: 20 05-08-05 
; PRIOR APPLICATION NUMBER: US 6 0/445,217 
; PRIOR FILING DATE: 2003-02-05 
; NUMBER OF SEQ ID NOS : 9 



http://es/ScoreAccessWeb/GetItem.action?AppId=10627604&seqId=583750&U^ 8/15/06 



SCORE Search Results Details for Application 10627604 and Search Result us-1 0-627-6... Page 3 of 13 



; SOFTWARE: Patentln version 3.2 
; SEQ ID NO 9 

LENGTH: 2 233 

TYPE: PRT 
; ORGANISM: homo sapiens 
US-10-544-731-9 

Query Match 10.6%; Score 118.5; DB 6; Length 2233 ; 

Best Local Similarity 21.6%; Pred. No. 0.0034; 

Matches 51; Conservative 24; Mismatches 94; Indels 67; Gaps 7; 

Qy 12 PLACILMKSCL AFKNDATEI LYSH WKP VPAHPS SNSTLN 51 

Ihl hH : I I I - I I I I I 

Db 2005 PLSCSLVRSRIPAPAKGGFTPRWVWGAVIIPAAPADTPSCLGLSTPEPGPMSPSLTSVGA 2064 

Qy 52 QARNGGRHFSNTGLDRNTRVQVGCREL RSTKYISDGQCTSISP 94 

II : | : | : I hi I 

Db 2065 AERLGTEGAPLSAQDEARMSKDGCCRFCPPPPPPYQNQSTCAVYHRSLIIQQQGCSSSEP 2124 

Qy 95 LKELVCAGECLPLPVLPNWIGGGYGTKYWSRRSSQEWR CVNDKTRTQRIQLQCQDGS 151 

-III I : I := | | | :| : : | | M| 

Db 2125 VRLAYCRGNC GDSSSMYSLEGNTVEHRCQCCQELRTSLRNVTLHCTDGS 2173 

Qy 152 TRTYKITWTACKC - - KRY TRQHNE SSHNFESMSPAKPVQHH 191 

:| : I I I I H Ihl | ::| I h II 

Db 2174 SRAFSYTEVEECGCMGRRCPAPGDTQHSEEAEPEPSQEAESGSWERGVPVSPMHHH 2229 



RESULT 2 
US-10-544-731-8 

; Sequence 8, Application US/10544731 
; Publication No. US20060150262A1 
; GENERAL INFORMATION: 

; APPLICANT: Applied Research Systems ARS Holding N.V. 
; TITLE OF INVENTION: NOVEL MUCIN-LIKE POLYPEPTIDES 
; FILE REFERENCE: 825-PCT 

; CURRENT APPLICATION NUMBER: US/10/5 44,731 

; CURRENT FILING DATE: 2005-08-05 

; PRIOR APPLICATION NUMBER: US 6 0/445,217 

; PRIOR FILING DATE: 2003-02-05 

; NUMBER OF SEQ ID NOS : 9 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 8 

LENGTH: 2 202 

TYPE: PRT 

ORGANISM: homo sapiens 
US-10-544-731-8 

Query Match 9.2%; Score 102.5; DB 6; Length 22 02; 

Best Local Similarity 21.0%; Pred. No. 0.14; 

Matches 49; Conservative 24; Mismatches 93; Indels 67; Gaps 7; 

Qy 12 PLACILMKSCL AFKNDATEI LYSH WKP VP AH PS SNSTLN 51 

Ihl h:| : III HI II I 

Db 1980 PLSCSLVRSRIPAPAKGGFTPRWVWGAVIIPAAPADTPSCLGLSTPEPGPMSPSLTSVGA 2039 

Qy 52 QARNGGRHFSNTGLDRNTRVQVGCREL RSTKYISDGQCTSISP 94 

II : I : II I 1 = 1 I 

Db 2040 AERLGTEGAPLSAQDEARMSKDGCCRFCPPPPPPYQNQSTCAVYHRSLIIQQQGCSSSEP 2099 

Qy 95 L KE L V CAGE CL PL P VL P NW I G GGY GT KY WS RR S S QE WR CVN DKTRTQ RI QL QC QD GS 151 
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SCORE Search Results Details for Application 
10627604 and Search Result us-10-627-604- 

2.rapm. 



Score Home 
Page 



Retrieve Application 
List 



SCORE System 
Overview 



SCORE 
FAQ 



Comments / 
Suggestions 



This page gives you Search Results detail for the Application 10627604 and Search Result us-10- 

627-604-2. rapm. 

start 

Go Back to previous page 

GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM protein - protein search, using sw model 
Run on: August 12, 2006, 07:01:50 ; 



Search time 60 6 Seconds 
(without alignments) 
519.064 Million cell updates/sec 



Title : 

Perfect score: 
Sequence : 

Scoring table: 



US-10-627-604-2 
1115 

1 MLP PA I H FY LL PL AC I LMKS , 



. PVQHHRERKRASKSSKHSMS 206 



BLOSUM62 
Gapop 10.0 



Gapext 0 . 5 



Searched: 8366291 seqs, 1526956180 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



8366291 



Database 



Pending_ Patent s_AA_Ma 

1 : /EMC_Celerra_SIDS3 

2 : /EMC_Celerra_SIDS3 

3 : /EMC_Celerra_SIDS3 

4 : /EMC_Celerra_SIDS3 

5 : /EMC_Celerra_SIDS3 

6 : /EMC_Celerra_SIDS3 

7 : /EMC_Celerra_SIDS3 

8 : /EMC_Celerra_SIDS3 

9 : /EMC_Celerra_SIDS3 

10: /EMC_Celerra_SIDS 

11: /EMC_Celerra_SIDS 

12 : /EMC_Celerra_SIDS 

13 : /EMC_Celerra_SIDS 

14 : /EMC_Celerra_SIDS 

1 5 : /EMC_Cel er ra_S IDS 

16 : /EMC_Celerra_SIDS 



in:* 

/ptodata/2/paa/PCTUS 
/ptodata/2/paa/US066 
/ptodata/2/paa/US0 73 
/ptodata/2/paa/US0 74 
/ptodata/2/paa/US0 75 
/ptodata/2/paa/US0 76 
/ptodata/2/paa/US0 77 
/ptodata/2/paa/US078 
/ptodata/2/paa/US0 79 
3/ptodata/2/paa/US08 
3/ptodata/2/paa/US08 
3/ptodata/2/paa/US08 
3/ptodata/2/paa/US08 
3/ptodata/2/paa/US08 
3/ptodata/2/paa/US08 
3/ptodata/2/paa/US08 



COMB. pep: * 
COMB. pep: * 
COMB. pep: * 
_COMB.pep: * 
_COMB. pep: * 
COMB. pep 



: * 

COMB. pep: * 
~COMB.pep: * 
_COMB.pep: * 
0_COMB .pep 
l_COMB .pep 
2 _ CO MB .pep 
3_COMB .pep 
4 _ CO MB .pep 
5_COMB .pep 
6 _ COMB .pep 
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17 : /EMC_CelerrajSIDS3/ptodata/2/paa/US087_COMB .pep : * 

18 : /EMC_Celerra_SIDS3/ptodata/2/paa/US088_COMB .pep : * 

19: /EMC_Celerra_SIDS3/ptodata/2/paa/US089_COMB .pep:* 

20: /EMC_Celerra_SIDS3/ptodata/2/paa/US090_COMB.pep:* 

2 1 : /EMC_Celerra_SIDS3/ptodata/2/paa/US091_COMB .pep : * 

2 2 : /EMC_Cel er ra_S IDS 3 /pt odat a/ 2 / paa/US 09 2_COMB . pep : * 

2 3 : /EMC_Cel er ra_S IDS 3 / pt odat a/ 2 /paa/US 09 3_COMB . pep : * 

24 : /EMC_CelerrajSIDS3/ptodata/2/paa/US094_COMB.pep:* 

25: /EMC_Celerra_SIDS3/ptodata/2/paa/US095_COMB.pep :* 

26: /EMC_Celerra_SIDS3/ptodata/2/paa/US096_COMB.pep:* 

2 7 : /EMC_Celerra_SIDS3/ptodata/2/paa/US097_COMB .pep : * 
28 : /EMC_Celerra_SIDS3/ptodata/2/paa/US098_COMB .pep : * 
29 : /EMC_Celerra_SIDS3/ptodata/2/paa/US099_COMB .pep : * 
30: /EMC_Celerra_SIDS3/ptodata/2/paa/US100_COMB.pep:* 
3 1 : /EMC_Cel er ra_S IDS3 /pt odat a/ 2/paa/US 10 l_COMB . pep : * 

3 2 : /EMC_Cel er ra_S IDS 3 /pt odat a/ 2/paa/US 10 2_COMB . pep : * 
33 : /EMC_Celerra_SIDS3/ptodata/2/paa/US103_COMB.pep:* 
34 : /EMC_Celerra_SIDS3/ptodata/2/paa/US104_COMB.pep:* 
35 : /EMC_Celerra_SIDS3/ptodata/2/paa/US105_COMB .pep : * 
36 : /EMC_Celerra_SIDS3/ptodata/2/paa/US106_COMB .pep : * 
37 : /EMC_Celerra_SIDS3/ptodata/2/paa/US107_COMB .pep : * 
38 : /EMC_Celerra_SIDS3/ptodata/2/paa/US108__COMB .pep : * 
39: /EMC_Celerra_SIDS3/ptodata/2/paa/US109_COMB.pep:* 
40 : /EMC_Celerra_SIDS3/ptodata/2/paa/US110_COMB .pep : * 
41: /EMC_Celerra_SIDS3/ptodata/2/paa/USlll_COMB .pep:* 
42: /EMC_Celerra_SIDS3/ptodata/2/paa/US112_COMB .pep:* 
43: /EMC_Celerra_SIDS3/ptodata/2/paa/US113_COMB.pep:* 
44 : /EMC_Celerra_SIDS3/ptodata/2/paa/US114_COMB .pep:* 
45 : /EMC_Celerra_SIDS3/ptodata/2/paa/US600_COMB .pep : * 

4 6 : /EMC_Cel er ra_S IDS 3 / pt odat a/ 2 / pa a/ US 6 0 l_COMB . pep : * 
4 7 : /EMC_Celerra_SIDS3/ptodata/2/paa/US602_COMB .pep : * 
4 8 : /EMC_Celerra_SIDS3/ptodata/2/paa/US603_COMB .pep : * 

4 9 : /EMC_Celerra_SIDS3/ptodata/2/paa/US604_COMB .pep : * 

5 0 : /EMC_Celerra_SIDS3/ptodata/2/paa/US605_COMB .pep : * 
51: /EMC_Celerra_SIDS3/ptodata/2/paa/US606_COMB.pep:* 
52: /EMC_Celerra__SIDS3/ptodata/2/paa/US607_COMB.pep:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 



% 

Query 

Score Match Length DB ID 



Description 



1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 



1115 100.0 

1115 100.0 

1115 100.0 

1115 100.0 

1115 100.0 

1115 100.0 

1115 100.0 

1115 100.0 

1115 100.0 

1115 100.0 

1115 100.0 

1115 100.0 

1115 100.0 

1115 100.0 

1115 100.0 



206 31 US-10-170-205E-29147 



206 32 US-10-223-081-70 
206 32 US-10-223-082-70 



206 1 PCT-US98-13530-12 

206 1 PCT-US99-31005-17 

206 14 US-08-468-847-2 

206 18 US-08-885-610-12 

206 20 US-09-053-587-20 

206 22 US-09-220-876-17 

206 28 US-09-850-726-17 

206 28 US-09-853-625-20 

206 28 US-09-853-625B-20 

206 30 US-10-081-056-70 



206 1 PCT-US03-17409-213 
206 1 PCT-US04-24424-2058 



Sequence 213, App 
Sequence 2058, Ap 
Sequence 12, Appl 
Sequence 17, Appl 
Sequence 2, Appli 
Sequence 12, Appl 
Sequence 2 0, Appl 
Sequence 17, Appl 
Sequence 17, Appl 
Sequence 2 0, Appl 
Sequence 20, Appl 
Sequence 70, Appl 
Sequence 29147, A 
Sequence 70, Appl 
Sequence 70, Appl 
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16 


1115 


100 


. 0 


206 


32 


US- 


10 


-223 


-083 


-70 


Sequence 


70, Appl 


17 


1115 


100 


. 0 


206 


32 


US- 


10 


-223 


-084 


-70 


Sequence 


7 0 , App 1 


18 


1115 


100 


. 0 


206 


32 


US - 


10 


-223 


-085 


-70 


Sequence 


7 0 , App 1 


19 


1115 


100 


. 0 


206 


32 


US- 


10 


-223 


-087 


-70 


Sequence 


7 0 , App 1 


20 


1115 


100 


. 0 


206 


32 


US- 


10 


-223 


-088 


-70 


Sequence 


7 0 , App 1 


21 


1 115 


100 


. 0 


206 


32 


US- 


10 


-223 


- 089 


-70 


Sequence 


7 0 , App 1 


22 


1115 


100 


. 0 


206 


32 


US - 


10 


-223 


-090 


- 70 


Sequence 


7 0, App 1 


23 


1115 


100 


. 0 


206 


33 


US- 


10 


-305 


-654 


- 70 


Sequence 


7 0 , App 1 


24 


1115 


100 


. 0 


206 


34 


US- 


10 


-405 


-027 


-323 7 


Sequence 


3 23 7, Ap 


25 


1115 


100 


. 0 


206 


34 


US- 


10 


-405 


-027 


-457 3 


Sequence 


4 5 7 3 , Ap 


26 


1115 


100 


. 0 


206 


34 


US- 


10 


-464 


-368 


- 104 


Sequence 


1 04 , App 


27 


1115 


100 


. 0 


206 


35 


US- 


10 


-567 


-867 


-205 8 


Sequence 


2 058, Ap 


28 


1115 


100 


. 0 


206 


36 


US- 


10 


-627 


-604 


- 2 


Sequence 


2 , Appl i 


29 


1115 


100 


. 0 


206 


37 


US- 


10 


-743 


-643 


-207 1 


Sequence 


2 071, Ap 


30 


1115 


100 


. 0 


206 


49 


us- 


60 


-490 


-890 


-205 8 


Sequence 


2 058, Ap 


31 


1115 


100 


. 0 


223 


27 


US- 


09 


-758 


-444 


-401 


Sequence 


4 01, App 


32 


1115 


100 


. 0 


223 


32 


US- 


10 


-212 


-160 


-401 


Sequence 


4 01, App 


33 


1109 


99 


.5 


206 


20 


us- 


09 


-053 


-587 


-2 


Sequence 


2 , Appl i 


34 


1109 


99 


. 5 


206 


27 


us- 


09 


- 778 


-381 


-119 


Sequence 


119, App 


35 


1109 


99 


. 5 


206 


28 


US- 


09 


- 853 


-625 


-2 


Sequence 


2 , Appl i 


36 


1109 


99 


. 5 


206 


28 


us- 


09 


-853 


-625B-2 


Sequence 


2 , App 1 i 


37 


1109 


99 


. 5 


206 


34 


US- 


10 


-453 


-919 


-119 


Sequence 


119, App 


38 


1108 


99 


. 4 


206 


28 


US- 


09 


-834 


-366 


-19284 


Sequence 


1 92 84 , A 


39 


1108 


99 


. 4 


206 


46 


US- 


60 


- 197 


-873 


-19284 


Sequence 


192 84 , A 


40 


1100 . 5 


98 


. 7 


205 


28 


US- 


09 


- 853 


-625C-2 


Sequence 


2, Appli 


41 


1088 


97 


.6 


230 


40 


US- 


11 


-072 


-512 


-3602 


Sequence 


3602, Ap 


42 


1067 


95 


.7 


206 


20 


US- 


09 


-069 


-726 


-159 


Sequence 


159, App 


43 


1067 


95 


.7 


206 


28 


us- 


09 


-866 


-050A-159 


Sequence 


159, App 


44 


1067 


95 


.7 


206 


28 


us- 


09 


-866 


-050A-286 


Sequence 


28 6, App 


45 


1062 


95 


.2 


206 


34 


us- 


10 


-464 


-368 


-105 


Sequence 


105, App 



ALIGNMENTS 



RESULT 1 

PCT-US03-17409-213 

Sequence 213, Application PC/TUS0317409 
GENERAL INFORMATION: 
APPLICANT: INCYTE CORPORATION 
APPLICANT: RICKERT, Paula K . 
APPLICANT: KRASNOW, Randi 

TITLE OF INVENTION: DIAGNOSTIC MARKERS FOR LUNG CANCER 
FILE REFERENCE: PA- 0051 PCT 

CURRENT APPLICATION NUMBER: PCT/US03/ 17409 
CURRENT FILING DATE: 2003-06-02 
PRIOR APPLICATION NUMBER: US 60/386,005 
PRIOR FILING DATE: 2 002-06-04 
NUMBER OF SEQ ID NOS : 296 
SOFTWARE : PERL Program 
SEQ ID NO 213 
LENGTH: 2 06 
TYPE: PRT 

ORGANISM: Homo sapiens 
FEATURE : 

NAME/ KEY : misc_f eature 

OTHER INFORMATION: Incyte ID No: 142 03 77CD1 
PCT-US03-17409-213 

Query Match 100.0%; Score 1115; DB 1; Length 206; 

Best Local Similarity 100.0%; Pred. No. 1.3e-118; 
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SCORE Search Results Details for Application 
10627604 and Search Result us-10-627-604- 
2,rapn. 



Score Home 
Page 



Retrieve Application 
List 



SCORE System 
Overview 



SCORE 
FAQ 



Comments / 
Suggestions 



This page gives you Search Results detail for the Application 10627604 and Search Result us- 10- 

627-604-2.rapn. 

start 

Go Back to previous page 

GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM protein - protein search, using sw model 
Run on: August 12, 2006, 07:02:50 ; 



Search time 35 Seconds 
(without alignments) 
393.959 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



US-10-627-604 -2 
1115 

1 MLPPAIHFYLLPLACILMKS . 



PVQHHRERKRASKSSKHSMS 2 06 



BLOSUM6 2 
Gapop 10.0 



Gapext 0 . 5 



Searched: 298239 seqs, 66934740 residues 

Total number of hits satisfying chosen parameters: 



298239 



Minimum DB seq length: 
Maximum DB seq length: 



2000000000 



Pos t- pro cess ing : Minimum Match 0% 

Maximum Match 10 0% 
Listing first 45 summaries 

Database : Pending_Patent s_AA_New: * 

1 : /EMC_Celerra_SIDS3 /ptodata/2 /paa/PCT_NEW_COMB. pep: * 

2 : /EMC_Celerra_SIDS3 /ptodata/2 /paa/US06_NEW_COMB .pep : * 

3 : /EMC_Celerra_SIDS3 /ptodata/2 /paa/US07_NEW_COMB .pep : * 

4 : /EMC_Celerra_SIDS3 /ptodata/2 /paa/US0 8_NEW_COMB .pep : * 

5 : / EMC_Ce lerra_SIDS3 /ptodata/2 /paa/USO 9_NEW__C0MB . pep : * 

6 : /EMC_Celerra_SIDS3 /ptodata/2 /paa/USl 0_NEW_COMB .pep:* 

7 : /EMC_Celerra_SIDS3 /ptodata/2 /paa/USl 1_NEW_C0MB . pep:* 

8 : /EMC_Celerra_SIDS3/ptodata/2/paa/US6 0_NEW_COMB.pep:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



% 

Query 
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No . 


Score 


Match 


Length 


DB 


ID 


Description 


1 


34 1 


30 


.6 


1 Q A 

19 0 


7 


T TO IT >l 1 1 AAO 1 

US -11-411 -003 -1 


Sequence 


1 , Appl i 


Z 


"3 A A 
34 0 


30 


.5 


213 


•-7 
/ 


TTO 11 /111 A AO GO 

Ub -11-411-003 - 9H 


Sequence 


98, Appl 


3 


9 5.5 


8 


.6 


375 


7 


US -11-455 -506 - 20 


Sequence 


20, Appl 


4 


95 


8 


.5 


O *7 A 

37 9 


1 


PCT-Ub0 6 -11960-555 


Sequence 


555, App 


5 


95 


8 


.5 


379 


6 


US -10-182-432A-2 


Sequence 


2, Appli 


6 


95 


8 


.5 


379 


7 


US -11-455-5 06 -2 


Sequence 


2 , Appl i 


7 


90 


8 


.1 


381 


1 


PCT-US06 - 058 12 - 1 


Sequence 


1, Appli 


8 


9 0 


8 


.1 


381 


6 


TTO 1 /> 1 O ^ A *\ *\ H A 

US -10-182-432A-4 


Sequence 


4, Appli 


9 


90 


8 


.1 


381 


7 


US -11-4 55 -506-4 


Sequence 


4 , Appl i 


10 


82 


7 


.4 


1 04 8 


7 


TTO 11 H r- ^\ *"r 1 O O 

US -11-475-062 -6109 


Sequence 


6109, Ap 


11 


82 


7 


.4 


149 0 


7 


US -11-475-062 -6110 


Sequence 


6110, Ap 


12 


8 2 


7 


.4 


2 83 9 


7 


TTO 11 / nr r\ C ^111 

US -11-475-062 - 6111 


Sequence 


6111, Ap 


13 


82 


7 


.4 


5 17 9 


7 


TTO 11 O 1 O y« -l O -IOC 

US -11-318 -418 -185 


Sequence 


185, App 


1 A 

1 4 


o 2 


7 


.4 




/ 


TTO 11 *5 1 O y| 1 O 1 Q C 

Ub -ll-31o -418 -lo5 


Sequence 


185, App 


1 C 
15 


O A 

82 


7 


.4 


517 9 


7 


T TO 11 /l*5*7 11Q 1 £ 

Ub -11-437-729-1766 


Sequence 


1766, Ap 


16 


82 


7 


.4 


5 179 


7 


US-ll-475-062-2277 


Sequence 


2277, Ap 


17 


81.5 


7 


.3 


393 


1 


PCT-US0 6-18535-325 05 


Sequence 


32505, A 


18 


81.5 


7 


.3 


393 


7 


US -11-431-855-325 05 


Sequence 


32505, A 


19 


81.5 


7 


.3 


393 


8 


US-60-815-535-199 


Sequence 


199, App 


2 0 


79 


7 


.1 


35 3 


1 


PCT-US0 6-185 35-5233 


Sequence 


5233, Ap 


2 1 


79 


7 


.1 


353 


7 


TTO 11 Jl *"l 1 OP*P" f™ O O 

US-11-431-855-5233 


Sequence 


5233, Ap 


22 


77.5 


7 


.0 


34 2 


1 


t^ /"i m tto^^ irtf~^^ oo^*r*vi 

PCT-US0 6-185 35 -226 54 


Sequence 


22654, A 


2 3 


7 7.5 


7 


.0 


34 2 


7 


TTO 11 /I n occ A A ^ c vi 

US -11-431-855-22654 


Sequence 


22654, A 


24 


7 7 


6 


.9 


42 6 


6 


US -10-534-579-20 


Sequence 


20, Appl 


2 5 


7 7 


6 


.9 


50 0 


7 


US -11 -490-374-377 


Sequence 


3 77, App 


26 


7 7 


6 


.9 


50 0 


7 


T TO 11 A e\l\ *") •"? /I 1 HQ 

US -11 -490-374-378 


Sequence 


3 78, App 


2 7 


7 7 


6 


.9 


50 0 


7 


tto ii » nn i i >i *>io 

US -11 -490-374-379 


Sequence 


3 79, App 


2 8 


7 6 


6 


.8 


604 


7 


TTO 11 1/IC A AO 1 1 

US -11-345 -403 -12 


Sequence 


12, Appl 


2 9 


75.5 


6 


.8 


1150 


1 


PCT- US 0 6-18535-24537 


Sequence 


24537, A 


3 0 


75 . 5 


6 


.8 


1150 


7 


US -11-431 -855 -24537 


Sequence 


24537, A 


3 1 


75 


6 


.7 


445 


7 


US -11-490-3 74 -3 76 


Sequence 


3 76, App 


32 


74 . 5 


6 


.7 


33 9 


7 


US -11-488-3 75 -32 


Sequence 


32, Appl 


33 


74.5 


6 


.7 


33 9 


7 


TTO II 4 AA i n r "1 ^* 

US-11-488-375-36 


Sequence 


36, Appl 


34 


74.5 


6 


.7 


354 


7 


US-11-488-375-37 


Sequence 


37, Appl 


35 


74.5 


6 


.7 


372 


6 


TTO Irt ^Ort >■ a f\ 

US -10-533-520-4748 


Sequence 


4748, Ap 


3 6 


74.5 


6 


.7 


372 


7 


TTO 11 A O O "J nr 

US -11-488-375-33 


Sequence 


33, Appl 


3 7 


74.5 


6 


.7 


402 


7 


T TO 11 ilTC A ^ A r ^ n » 

US -11-475-062 -5674 


Sequence 


5674, Ap 


"3 O 
J O 


n a c 
f 4 . D 


6 


.7 


A O O 

42 o 


/ 


TTO 1 1 A*7C C^*7^ 

Ub -11-4 /5 - 06 2 -bo / 6 


Sequence 


5676, Ap 


3 9 


74.5 


6 


.7 


42 8 


7 


t to ii ji q r ft^ri c ci e\ 

US -11-475- 06 2-5678 


Sequence 


5678, Ap 


/i a 


1 A C 

/ 4 . 5 


6 


.7 


/no 

42 o 


/ 


T TO 11 /I *7 C A ^ A r r q n 

Ub -11-4 /5 - 062 -56o 3 


Sequence 


5683, Ap 


41 


74.5 


6 


. 7 


1686 


1 


PCT-US06-04985-1 


Sequence 


1, Appli 


42 


73 


6 


.5 


555 


6 


US-10-533-520-3634 


Sequence 


3 634, Ap 


43 


72. 5 


6 


.5 


698 


7 


US-11-336-706-15 


Sequence 


15, Appl 


44 


72. 5 


6 


.5 


1038 


6 


US-10-670-914-91 


Sequence 


91, Appl 


45 


72 


6 


.5 


467 


6 


US-10-311-974A-9 


Sequence 


9, Appli 



ALIGNMENTS 



RESULT 1 
US-11-411-003-1 

Sequence 1, Application US/11411003 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Paszty, Christopher 
Robinson, Martyn Kim 
Graham, Kevin 
Henry, Ali stair James 
Hoffmann, Kelly Sue 
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APPLICANT: Latham, John 
APPLICANT: Lawson, Alastair 
APPLICANT: Lu , Hsieng Sen 
APPLICANT: Popplewell, Andy 
APPLICANT: Shen, Wenyan 
APPLICANT: Winkler, David 
APPLICANT: Winters, Aaron George 
TITLE OF INVENTION: Binding Agents 
FILE REFERENCE: 60117-224 

CURRENT APPLICATION NUMBER: US/l 1/4 11 , 0 03 
CURRENT FILING DATE: 2006-04-25 
PRIOR APPLICATION NUMBER: 
PRIOR FILING DATE: 2006-04-17 
PRIOR APPLICATION NUMBER: 60/782,244 
PRIOR FILING DATE: 2 006-03-13 
PRIOR APPLICATION NUMBER: 60/776,84 7 
PRIOR FILING DATE: 2006-02-24 
PRIOR APPLICATION NUMBER: 60/667,583 
PRIOR FILING DATE: 2005-05-03 
NUMBER OF SEQ ID NOS : 396 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 1 
LENGTH: 190 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-11-411-003 -1 

Query Match 3 0.6%; Score 341; DB 7; Length 19 0; 

Best Local Similarity 42.7%; Pred . No. 7.3e-27; 

Matches 79; Conservative 30; Mismatches 62; Indels 14; Gaps 8; 

Qy 23 AFKNDATEI L - - YSH WKPVPAHPS SNSTLNQARNGGR- - HFSNTGLDRNTRVQVGCREL 78 

Illlllllh I I :| |:hl I I I I I I • = I I I I 
Db 5 AFKNDATEI IPELGEYPEP-PPELENNKTMNRAENGGRPPHHPFETKDVS EYSCREL 60 

Qy 79 RSTKYISDGQCTSISPLKELVCAGECLPLPVLPNWIGGGYGTKYWSRRSSQEWRCVNDKT 13 8 

h|::|| I I h lllhhl I hi || I hill ::||: h 
Db 61 H FTRYVTDGPCRS AKPVTE LVCSGQCGP ARLL PNAI GRG KWW - RPS GP DFRC I PDRY 116 

Qy 139 RTQRIQLQCQDG-STRTYKI TVVTACKCKRYTRQHNESSHNFESMSPAKPVQHHRERKRA 197 

I Ihll I 1 = 1 h h -11111 II 11=1 hi = : I II 

Db 117 RAQRVQLLCPGGEAPRARKVRLVASCKCKRLTRFHNQSELKDFGTEAARPQKGRKPRPRA 176 

Qy 198 SKSSK 2 02 

:hl 

Db 177 -RSAK 180 



RESULT 2 

US-11-411-003-98 

Sequence 98, Application US/11411003 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Paszty, Christopher 
Robinson, Martyn Kim 
Graham, Kevin 
Henry, Ali stair James 
Hoffmann, Kelly Sue 
Latham, John 
Lawson, Alastair 
Lu, Hsieng Sen 
Popplewell, Andy 
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